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NOTATION  AND  NUMBERING 


We  define  some  notation  iidiich  may  not  be  con^iletely  standard. 
All  points  and  sets  are  in  unless  indicated  otherwise. 

1)  e"  = {x  e e“|x  > 0} 

n “ 

2)  For  X,  y e E , denote  their  inner  product  by  (x,y)  = E x.y. . 

i=l  ^ ^ 

3)  Let  ||x||  = {x,x)^'^^  be  the  norm  of  x. 

4)  Denote  by  d(*,*)  the  distance  function  defined  as 

d(A,B)  = inf  ||x-y|| 
xEA 
y€B 

5)  Let  the  ball  about  A of  radius  e , \diere  A may  be  a point  or 


a set,  be 


B(A,e)  = {x  € E“|d(x,A)  < e) 


6)  For  any  positive  integer  m,  let  m = {1,2, . . , ,m) . If  m = 0, 


then  m = 


7)  conv[A^,A2, . . . ,A^]  is  the  convex  hull  of  the  sets  (or  points) 


^1’  ^2’  ‘ ’ ^n' 


8)  E is  the  space  of  all  real-valued  matrices  with  n rows 

and  m columns . 


9)  If  a c n and  p c m are  index  sets,  then,  for  A € E*^^, 
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A consists  of  the  rows  of  A indexed  by  a,  and  i 

a,  • , 

A consists  of  the  columns  of  A indexed  by  u 
• > u 

A Is  the  submatrix  of  elements  whose  indices  are  in  a X u. 
o,  u 

10)  For  two  sets  A and  B, 

A\  B = {x|xeA,  x^B] 
and 

A-B  = {z|z  = x-y  for  some  x £ A,  auid  y £ B} , 

11)  The  boundary  of  a set  C is  called  dc,  and  the  Interior  of  set 
C is  called  C^. 

Numbering 

The  chapters  are  numbered  by  Roman  minerals  and  the  sections 
are  numbered  consecutively  within  each  chapter,  i.e.,  II. 1,  II. 2,  etc. 

All  theorems,  lemmas,  and  examples  are  niunbered  consecutively  within 
each  section.  Equations  are  numbered  separately  and  are  identified  | 

by  being  enclosed  in  parentheses.  Equation  (V.1.2)  is  the  second  { 

equation  in  Section  V.l,  for  instance.  The  chapter  n\imeral  is  omitted  ) 

for  results  or  equations  referred  to  within  the  same  chapter. 
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ABSTRACT  i 

I 

j 

The  problem  considered  is  that  of  finding  a solution  to  a system  j 

of  nonlinear  equations  subject  to  some  auxiliary  constraints.  The  | 

methods  studied  here  are  called  path  methods,  also  referred  to  as  | 

"continuation"  or  "global  Newton"  methods,  for  solving  equations.  A | 

general  theory  is  developed  which  unifies  the  results  from  several 
papers  and  allows  new  methods  to  be  analyzed  easily.  The  new  methods 
are  shown  to  converge  under  more  general  boundary  and  monotonicity 
conditions  than  those  assumed  for  the  existing  methods.  A rigorous 
proof  of  convergence  is  given  for  an  algorithm  which  implements  a 
general  path  method. 
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ON  THE  SOLUTION  OF  NONLINEAR  EQUATIONS  BY  PATH  METHODS 


CHAPTER  I 
GENERAL 


I.l.  Introduction  and  Summary 

The  problem  we  consider  is  that  of  finding  a solution  to  a system 
of  nonlinear  equations  subject  to  some  auxiliary  constraints.  The 
objective  is  to  find  methods  for  the  solution  of  such  systems  which  are 
convergent  under  rather  general  conditions  and  are  efficient  con^uta- 
tionally. 

The  predominate  efforts  in  studying  this  problem  in  recent  years 
have  made  use  of  "piecewise  linear"  or  "simplicial  methods"  for  solving 
systems  of  equations  (cf.  Scarf  [1967a],  Kuhn  [1968],  Eaves  and  Saigal 
[1972],  and  Merrill  [1972]).  But  a different  approach  to  the  equation- 
solving problem  has  been  revived  recently.  Kellogg,  Li  and  Yorke  [1976] 
and  S.  Smale  [19/’6]  have  proposed  "continuation"  or  "global  Newton" 
methods  for  solving  systems  of  equations.  The  history  of  "continuation" 
methods  goes  back  to  Lahaye  [195*+],  [1938]  in  the  univariate  case  and 
Davidenko  [1953a, b]  in  the  multivariate  case.  We  hope  to  elucidate 
the  common  features  of  these  methods  and  to  propose  some  new  methods 
which  converge  under  more  general  assumptions. 

This  report  contains  approximately  the  first  half  of  the 
author*  s dissertation.  The  Computation  of  Equilibria  by  Path  Methods. 
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Many  of  the  theorems  and  procedures  presented  here  are  applied  to  the 
development  of  algorithms  for  the  economic  equilibrium  problem.  When 
we  refer  to  results  or  chapters  in  the  report  which  contains  the  rest 
of  the  dissertation  (Elken  [1977]),  we  will  refer  to  them  as  being 
in  Part  2. 

In  the  remainder  of  this  chapter,  we  will  discuss  the  equation 
solving  problem  further  and  motivate  the  development  of  path  methods 
for  solving  systems  of  equations. 

In  Chapter  II  we  review  some  basic  results  from  differential 
topology  and  introduce  a unified  theoretical  framework  for  dealing  with 
path  methods.  The  reader  familiar  with  the  differential  topology  may 
want  to  skip  the  first  section  of  this  chapter.  The  theory  of  sub- 
divided complexes  which  follows  helps  to  provide  new  (and  easier) 
proofs  for  some  theorems  vrftiich  demonstrate  the  existence  of  paths  from 
a known  starting  point  to  a solution  point  for  a system  of  equations. 

In  Chapter  III  results  are  derived  which  can  be  applied  to  any 
path  method.  The  primary  concern  is  to  develop  results  which  show  that 
we  can  determine  a consistent  orientation  for  a path  and  that  a discrete 
algorithm  can  be  devised  which  is  gusiranteed  to  follow  a path  from 
one  boundary  of  a bounded  set  to  another  boundary.  This  convergence 
proof  is  a new  result. 

Chapter  TV  presents  the  existing  path  methods  of  Kellogg,  Li, 
and  Yorke  [1976]  and  Smale  [1976]  in  a homotopy  framework  and  new  proofs 
that  the  paths  have  the  desired  characteristics.  Also  new  path  methods 
are  developed  vdiich  are  motivated  by  existing  path  methods  or  fixed 
point  algorithms.  These  are  shown  to  be  convergent  under  more  general 
conditions  than  the  existing  path  methods. 
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We  are  concerned  with  proving  theorems  which  demonstrate  the 
existence  of  a path  from  a known  starting  point  € D c:  to  a 

solution  X*  of  the  problem 

f(x)  = 0 , X € D (2.1) 


where  f : E°  and  D satisfy  certain  conditions.  The  existence 

of  a differentiable  path  from  x^  to  x*  can  motivate  any  number  of 
algorithms  suitable  for  computation  of  x*.  We  will  present  several. 

First,  however,  we  discuss  other  approaches  to  studying  the  problem  (2.1). 

The  history  of  the  study  of  the  existence  question  and  algorithms 

for  finding  a solution  of  (2.1)  is  long  and  cannot  be  reported  in  full 
here.  The  most  celebrated  existence  theorem  is  the  Brouwer  fixed  point 
theorem  (Milnor  [1965]):  Let  g ; D ->  D be  continuous  and  D a 
con5)act  convex  subset  of  E*^.  Then  there  is  a point  x £ D with 
g(x)  = X.  With  a preliminary  definition  we  can  state  an  equivalent 
theorem  in  terms  of  (2.1).  We  say  that  f(x)  points  into  D for 
X £ SD  if  there  is  an  a > 0 such  that  for  P £ (0,a),  x + Pf(x)  £ D. 

The  equivalent  theorem  is;  Let  f ; D -»e’^  be  continuous  and  D be 
compact  and  convex.  If  f(x)  points  into  D for  each  x £ 3D  , then 
there  is  an  x such  that  f(x)  = 0.  The  equivalence  is  easily  seen 
when  one  notes  that  solving  g(x)  = x is  equivalent  to  solving 
f(x)  = 0 when  f(x)  = g(x)  - x and  that  solving  f(x)  =0  is  equivalent 
to  finding  a fixed  point  of  g when  g(x)  = x + Pf(x)  for  some  P > 0. 
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These  theorems  indicate  that  in  some  sense  fixed  point  problems 


and  equation  solving  problems  are  equivalent.  In  this  thesis  the 
discussion  will  be  concerned  with  problems  of  the  form  (2.1)  because 
the  formulas  involved  in  the  theorems  and  algorithms  are,  in  general, 
easier  to  state  in  this  framework. 

The  algorithms  for  solving  (2.1)  fall  into  three  general  classes; 

1)  Iterative  methods  such  as  Newton's  method  and  its  variants. 

2)  Piecewise  linear  methods.  Those  methods  generally  utilize  a tri- 
angulation of  D or  some  set  containing  D. 

3)  Path  methods,  also  described  as  "continuation  methods,"  or  "global 
Newton  methods." 

Iterative  methods  are  primarily  of  the  following  form: 
given  x^,  let  x^^^  = x^  - B^f(x^)  , k = 0, 1, . . . (2.2) 

where  B is  either  f ' (x  ) , an  approximation  thereof,  or  a matrix 
vdiich  behaves  like  f ' (x)  ^ on  some  subspace.  The  advantages  of  these 
methods  me  that  they  generally  provide  rapid  convergence  to  a solution 
when  x^  is  in  some  neighborhood  of  a solution  x*  to  (2.1).  On  the 
other  hand  the  only  convergence  theorems  with  reasonable  conditions  on  f 
are  local  results:  if  S is  a small  enough  neighborhood  of  x*  then  x^  £ S 
implies  that  (2.2)  is  a convergent  sequence. 

The  piecewise  linear  methods  are  also  referred  to  as  "com- 
plementary pivot  methods,"  "fixed  point  methods,"  or  "simplicial  methods." 
The  pioneering  work  in  this  field  was  due  to  Lemke  [1965],  Scarf  [1967a], 
and  Kuhn  [1^)68].  The  theory  has  had  a host  of  contributors  including  Eaves, 
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contained  in  Eave's  "A  Short  Course  in  Solving  Equations  with  P.  L. 

Homotopies"  [1976].  This  method  solves  problems  of  the  form  (2.1) 
by  considering  a piecewise  linear  approximation  g;D  -♦  E*'  of  f,  then 
adding  a parameter  e c [0,T],  0 < T < + <»  and  a fixed  vector  d c E° 
so  that  the  problem 

F(x,e)  = g(x)  + 0d  = 0 (2.3) 

has  a trivial  and  imique  solution  for  0 = T (or  0 sufficiently  large). 

If  the  collection  of  pieces  9?^  on  which  g is  linear  and  g(x)  satisfy 
certain  properties,  then  one  can  follow  a piecewise  linear  path  from 

(x^,T)  to  (x*,0)  defined  by  (x, 0)  £ F ^(O).  Then  g(x*)  = 0 and 
X*  is  an  approximation  to  the  solution  of  f(x)  =0  in  the  sense 
that  Ilf  (x*)  II  is  small. 

The  advantage  of  these  methods  is  that  they  do  not  require 
that  f be  differentiable.  In  fact  f can  be  a point  to  set  mapping 
which  has  a closed  graph.  Thus,  Kakutani  fixed  point  problems  (Kakutajii, 

[19^7])  can  be  solved.  This  is  important  because  often  it  is  convenient, 
if  not  necessary,  to  formulate  some  equilibrium  and  some  optimization 
problems  as  Kakutani  fixed  point  problems  (see  Scarf  [1973]  and  Eaves  I 

[1971a]).  The  disadvantages  are  that  often  the  triangulation  schemes 
are  difficult  to  arrive  at,  and  for  accurate  approximation,  the  grid 
size  may  need  to  be  so  small  that  the  convergence  of  the  algorithm 
becomes  too  slow. 

The  path  methods,  which  we  shall  concentrate  on  in  this  work, 
have  quite  a recent  history  also.  They  are  very  similar,  in  principle, 
to  the  piecewise  linear  methods  in  that  one  considers  a system  of 


equations  such  as 
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F(x,  (S)  = f(x)  + 6d  = 0 , 


X e S X [0,T]  , 


I 


i 
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for  example.  F is  defined  so  that  the  solution  is  trivial  when 
© = T,  and  under  certain  conditions  F~^(0)  = ( (x, 0) |f(x, 0)  = 0} 
defines  a path,  a set  homeomorphic  to  an  interval  in  R^,  from 
(x^,T)  to  (x*,0).  The  primary  difference  in  the  requirements  of 
path  methods  and  piecewise  linear  methods  is  that  in  the  path  methods 
we  require  that  f be  twice  continuously  differentiable  (C  ) rather 
than  merely  a closed  point-to-set  map. 

The  contributors  to  this  theory  fall  into  two  groups.  The 
work  before  1975  (Lahaye  [19^3],  [I9W],  Davidenko  [ 1955a, b], 

Freudenstein  and  Roth  [1965],  Meyer  [I968],  to  name  a few)  was  confined 
to  studying  continuation  methods  which  were  usually  based  upon  a homo- 
topy  H;D  X [0,1]  -»  E where  H(x,  1)  has  a trivial  solution  and 
H(x,0)  is  a solution  to  f(x)  = 0,  x € D.  A typical  formulation  would 
let 

H(x,t)  = f(x)  + tf(x^)  = 0,  for  some  x^  £ S. 

The  conditions  on  f which  these  authors  imposed  were  generally  quite 
strong.  A typical  assumption  would  be  that  f is  norm  coercive 
(!|x^||  -♦  00  implies  that  ||f(x^)||  -*  »)  and  that  f (x)  is  nonsingular 
everywhere.  These  conditions  would  imply  that  H ^(O)  is  connected  and 
monotone  in  t.  This  assumption  is  quite  difficult  to  check  and  is  un- 
likely to  be  satisfied  in  many  problems. 

In  1976  papers  were  published  by  Kellogg,  Li,  and  Yorke,  S.  Smale, 

and  R.  Wilson  all  proposing  methods  for  defining  paths  which  could  be 

followed  from  a known  starting  point  to  a solution.  The  first  two 
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methods  are  for  general  equation  solving  problems,  while  Wilson' s is 
for  solving  a piecewise  linear  approximation  of  a Walrasian  general 
equilibrium  model.  The  advantages  of  the  more  recent  approaches  are 
that  no  connectedness  or  monotonicity  are  required  in  and 

the  structure  of  the  set  S is  used  to  give  conditions  for  success 
which  are  easier  to  verify  than  those  for  most  continuation  methods. 

The  favorable  attributes  of  the  path  methods  are  that  global 
convergence  can  be  guaranteed  for  a large  class  of  problems,  no 
artificial  triangulations  need  be  defined,  and  efficient  iterative 
equation  solving  routines  can  easily  be  implemented  when  the  solution 

is  nearby.  Drawbacks  include  the  requirement  that  f be  at  least 

2 

C and  that  it  is  a difficult  problem  to  design  a computer  algorithm 
which  follows  a nonlinear  curve  in  K ^ This  last  difficulty  is  not 
as  overwhelming  as  one  might  think.  An  algorithm  which  is  guaranteed 
to  stay  near  a particular  curve  will  probably  be  slow,  in  general,  but 
algorithms  have  been  developed  which  approximately  follow  a path  and 
converge  to  the  solution  quite  quickly.  Kellogg,  Li,  and  Yorke  [1976] 
give  such  an  algorithm  with  numerical  results  and  Chapter  VIII  shows 
that  path  methods  provide  efficient  means  to  calculating  equilibrium 
points  as  well. 

Another  similarity  in  the  papers  of  Kellogg,  Li,  and  Yorke, 
Smale,  and  Wilson  was  the  use  of  Sard' s Theorem,  and  other  results  from 
the  theory  of  differential  topology  to  prove  their  theorems.  Since 
the  reader  may  not  be  familiar  with  these  results,  we  present  them  in 
the  next  chapter. 
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CHAPTER  II 


DIFFERENTIAL  TOPOLOGY  AND  SUBDIVIDED  COMPLEXES 

II. 1,  Background  in  Differential  Topology 

In  order  to  prove  global  convergence  theorems  for  path  methods, 

I 

] it  is  very  useful  to  use  some  of  the  results  from  the  theory  of  differ- 

I 

I ential  topology.  In  contrast  to  analysis,  differential  topology  deals 

with  the  properties  of  sets  and  functions  "in  the  whole"  as  well  as 

locally.  This  branch  of  mathematics  explores  the  relationship  of 

functions  f:M  ->  N and  the  sets  f”^(y)  for  values  y € N.  The 

definitions  and  results  in  this  section  are  from  Milnor  [1965]  and 

Hirsch  fl976].  The  theorems  stated  here  can  be  found  in  either  work. 

A map  f:M  -4  N,  where  M cz  nR™  and  N e is  of  class  C^ 

th 

if  all  r order  partial  derivatives  exist  and  are  continuous.  A 
map  f is  smooth  if  it  is  of  class  C for  every  r > 1.  A map 
f :M -» N is  called  a diffeomorphism  (resp.,  C^-diffeomorphism)  if  f 
carries  M homeomorphically  onto  N and  both  f and  f”^  are  smooth 
(of  class  C^). 

Definition  1. 1.  A subset  M cz  ]R  is  called  a smooth  manifold  (C  - 
manifold)  of  dimension  m iff  each  x € M has  a neighborhood  W H M, 
where  W is  an  open  subset  of  3R  , which  is  diffeomorphic  (C  -diffeo- 
morphic)  to  an  open  subset  U c IR™. 

We  shall  refer  to  a manifold  of  dimension  m as  an  m-manifold. 

I 
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Definition  1.2.  A circle  is  the  set  {x  6 3R  | ||x|l  = 1} . An  interval 
is  any  convex  set  in  1?  ^ containing  more  than  one  point. 

I* 

Theorem  1.3.  Any  C 1-manifold  is  a disjoint  union  of  connected  sets 
each  of  which  is  C^-diffeomorphic  to  either  a circle  or  an  interval  of 
real  numbers,  r > 1. 


y • 


The  usefulness  of  this  result  is  that  if  we  know  that  a set 
f (u)  is  a C 1-manifold,  and  x is  a boundary  point  of  that  manifold, 
then,  if  one  could  follow  that  connected  con^onent  of  f“^(u)  with  x®  as  a 
boundary  point,  one  would  never  return  to  a point  which  has  already  been 
visited.  Thus,  the  component  of  f''^(u)  must  either  have  one  other 
boundary  point  x*  or  it  must  be  diffeomorphic  to  a ray.  The  importance 
of  this  property  for  the  convergence  proofs  presented  below  is  analogous 
to  the  importance  of  the  celebrated  "Lemke  argument"  (Lemke  [1965], 

Scarf  [1975],  Eaves  [1976])  in  the  theory  of  solving  piecewise  linear 
equations. 

To  define  the  notion  f (x)  for  a map  f : M -» N of 

manifolds,  r > 1,  we  first  associate  with  each  x 6 M e ]R  a linear 
subspace  TM^  c IR  of  dimension  m called  the  tangent  space  of  M 

at  X.  Then  f (x)  will  be  a linear  mapping  from  TM  to  TM  ; 

X y 

where  y = f (x) . 

Ir  V 

For  an  open  set  U c E , TU^  = E . For  any  C map  f ; U V 
the  Jacobian 

f'(x)  = E^ 
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is  defined  as  the  matrix  of  pajrtial  derivatives,  i.e.. 


I 

I 

for  i e j 6 k. 

Now  let  us  define  the  tangent  space  TM^  for  an  arbitrary,  smooth, 
m-dimensional  manifold  M c ]R  . Choose  a parametrization 

g ; U -»  M c:  ]R  ^ 

of  a neighborhood  g(U)  of  x £ M,  with  g(u)  = x.  Here  U is  an 

m k 

open  subset  of  ]R  . Thinking  of  g as  a mapping  from  U to  ]R  , 

the  derivative 

g’  (u):]r“  -» 

is  defined.  Set  TM  equal  to  the  image  of  g'(u). 

' i 

j 

We  must  prove  that  this  construction  does  not  depend  upon  the  ^ 

It 

particular  choice  of  parametrization  g.  Let  h:  V -»M  c=  E , 

X £ h(X)  c M,  V = h ^(x)  , ; 

h ^ “ g:U^  ^ u -» ^ V . 


The  commutative  diagram  of  maps 


f’(x) 


id 


Sx . 


f (y);3R“  ^ k” 


has  rank  less  than  n.  Then  C will  be  called  the  set  of  critical 
points , M\C  the  set  of  regular  points,  f(C)  the  set  of  critical 
values , 8uid  N\  f(C)  the  set  of  regular  values  of  f. 

The  following  regular  value  theorem  is  often  used  to  define 
manifolds  (or  in  particular,  paths). 

Theorem  1.4.  Let  f =M  -»  N be  a map,  r > 1.  If  y € N is  a 

regular  value,  then  the  set  f ^(y)  c M is  a manifold  of 

dimension  m-n. 
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The  following  result  makes  precise  the  statement,  if  y is 

•I  r 

in  the  range  of  f,  then  "in  general"  f (y)  is  a C manifold. 
This  theorem  is  critical  in  proving  the  validity  of  path  methods. 


Theorem  1.^’.  (Sard's  Theorem)  Let  U cIR”  be  open,  f:U 
a map  and  C = {x  £ ujrank  f ' (x)  < n) . If  r > max{0,m-n} 

then  f(C)  has  Lebesgue  measure  zero. 

I 

I 

Since  we  are  concerned  primarily  with  1-manifolds  defined  by  ! 

f~^(y)  for  some  y £ N,  the  weakest  differentiability  condition  that 
can  be  specified  for  f , and  still  be  able  to  apply  Sard' s Theorem, 

I 

is  that  f be  C^. 

i 

j 

Definition  1.6.  Consider  the  half-space  = {(x,,...,x  ) £ ]r”|x_>0).  ; 

' " m n — 

The  boundary  Slf"  is  A subset  X c;  ]R^  is  called 

a manifold  with  boundary  if  each  x £ X has  a neighborhood 

U n X C^-diffeomorphic  to  an  open  subset  of  h'”.  The  boundary  SX 
is  the  set  of  all  points  in  X which  correspond  to  points  of  dH*" 
under  such  a diffeomorphism. 

Now  consider  a C map  f ; X -»  N,  r > 1,  from  an  m-manifold 
with  boundary  to  an  n-manifold  where  m > n. 


Theorem  l.Y.  If  y £ N is  a regular  value  for  both  f and  for  the 
restriction  f |3X,  then  f"  (y)  c:  X is  a C (m-n) -manifold  with  boundary. 
Furthermore,  S(f  ^(y))  is  precisely  equal  to  f ^(y)  n bX. 


This  is  the  form  of  the  regular  value  theorem  which  will  be 


I 

t . 


I 


I 


most  useful  in  proving  the  global  characteristics  of  certain  paths 
defined  in  the  chapters  below. 

II. 2.  Orientation 

In  this  section  we  present  some  standard  notions  of  orientation 
vdiich  will  be  useful  to  use  when  we  prove  results  concerning  the 
orientation  of  paths  in  Section  III. 2.  The  following  definitions  are 
again  an  sunalgamation  of  those  in  Milner  [I965]  and  Hirsch  [1976]. 

Definitions.  An  orientation  for  a finite  dimensional  real  vector  space 

V is  an  equivalence  class  of  ordered  bases  as  follows:  The  ordered 

basis  (b, b ) = B determines  the  same  orientation  as  the  basis 

(b^, . . . , b^)  = B’  if  B'  = BA  where  A is  an  n x n matrix  with 

det  A > 0.  It  determines  the  opposite  orientation  if  det  A < 0. 

The  vector  space  3R  has  a standard  orientation  corresponding  to  the 

basis  (e, e ) where  e.  is  the  i ^ unit  vector  denoted  by  u)  . 

I n 1 n 

In  the  case  of  a zero  dimensional  vector  space  it  is  convenient 
to  define  an  "orientation"  as  the  symbol  +1  or  -1. 

We  shall  denote  an  orientation  co  of  V by  to  = [b^, ...,  b^] 
as  the  equivalence  class  of  bases  with  the  same  orientation  as  V. 

(V,to)  is  an  oriented  vector  space,  -to  denotes  the  opposite  orienta- 
tion to  to. 
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If  L V -» W is  an  isomorphism  of  vector  spaces  and 

(JO  = [e, e ] is  an  orientation  of  V then  L(a))  = [Le, Le  ] 

I n I n 

is  the  induced  orientation  of  W.  Given  (V,(jo)  and  (W,(jo'),  an 

isomorphism  L:V  -» W is  called  orientation  preserving  if  L(co)  = (o'  ; 

otherwise  L is  orientation  reversing. 

An  oriented  manifold  consists  of  a V smooth  manifold  M 

together  with  a choice  of  orientation  (o^  for  each  tangent  space 

TM^.  If  m > 1 these  are  required  to  fit  together  as  follows;  For 

each  X € M there  should  exist  a neighborhood  U c M and  a diffeo- 

morphism  h mapping  U onto  an  open  subset  of  3R  such  that 

h'  (x)  (o)  ) = (cn  ) for  each  x € U,  If  M is  connected  and  orientable 
X m 

then  it  has  precisely  two  orientations. 

If  M has  a boundary  then  we  can  distinguish  three  kinds  of 


vectors  in  the  tangent  space  TM^  at  a boundary  point; 

1)  There  are  the  vectors  tangent  to  the  boundary,  forming  an  (m-1)- 
dimensional  subspace  T(dM)^  c TM^. 

2)  There  are  the  "outward"  vectors,  forming  an  open  half  space  bounded 
by  T(5m)^;  and 

3)  There  are  the  "inward"  vectors  forming  a complementary  half-space. 

Each  orientation  co  for  M determines  an  orientation  for 
as  follows.  For  x £ dM  choose  a. basis  B such  that  [b^, ...,b^] 

= uj(M)  and  in  such  a way  that  b^,  b^,  ...  , contained  in 

T('3M)  and  so  that  b is  an  outward  vector.  Then  (b, ,...,b  ,] 

' ^x  m 1'  ' m-1-' 

= (jd^(3M),  the  induced  orientation  of  at  x. 


In  this  section  we  will  essentially  be  extending  the  piecewise 


I 

1 

i 

> 


linear  topology  developed  by  Eaves  [1976]  to  the  case  vdien  the  functions 

p 

are  C on  each  "cell"  ajid  the  cells  are  defined  by  an  intersection  of 
a finite  number  of  manifolds  with  boundary. 

A set  cr  c:  3R  is  a cell  if  o is  non-empty  and  satisfies 

a = {x  € ]R'^|bj^(x)  <0,  i € £;  b^(x)  =0;  i € £ \ £}  , 

n ? 

for  some  p and  q,  0 < q < p where  b^  = ]R  -»  IR  are  C functions. 
The  interior  relative  to  the  manifold  {x  £ ]R*^(b^(x)  = 0;  i £ £\  a) 
denoted  by 

= {x  £ K^lbjx)  <0,  i £ £;  b^(x)  = 0,  i £ £\£] 
is  also  assumed  to  be  non-empty.  Let 

A(x)  = [i|b^(x)  =0,  i £ £}  for  x £ a 

be  the  set  of  binding  constraints  of  x relative  to  a. 

We  shall  always  assume  that,  for  x £ a,  that  {7bj^(x)  | i £ A(x) } 

» 

is  a linearly  ind^endent  set  of  vectors  (if  A(x)  4 90.  This  is  a 

commonly  used  constraint  qualification  in  nonlinear  programming. 

• » 

A cell  0 c:  iP'  is  called  a cell  of  dimension  m,  or  an  m-cell, 
if  each  X € (p  has  a neighborhood  W n a which  is  diffeomorphic  to 
an  open  subset  U of  IR™. 

Sxippose  the  index  set  B c;  £ is  equal  to  A(x)  .for  some 
X £ a.  Then 


l6 


P H {x|bj^(x)  < 0,  i e qV,  b^(x)  =0,  i e B) 


is  a face  of  ct.  Notice  that  B r>  £\  B is  called  the  index  set 
associated  with  P.  Also  note  that  P could  be  all  of  a or  a 
single  point. 

Lemma  3,1.  If  a face  p c ct  has  an  associated  index  set  B,  then  let 
r = |b(.  Then  if  a c r < n and  p is  a cell  of  dimension  n-r. 

2 

Proof.  Let  S z)  or  be  a smooth  manifold.  Define  the  C map 

h : S as  h(x)  = | bj^(x)  \ 

' : ' i€B. 

Then,  by  definition  of  ct,  for  any  x £ P,  {Vb^(x) | i € B)  is  a linearly 
independent  set,  or  equivalently,  h’ (x)  is  of  reuik  r for  any  x £ P. 

Thus,  0 is  a regular  value  of  h,  and  by  Theorem  2.k,  h"^(0)  is  a 

2 "1 
C manifold  of  dimension  of  dimension  n-r.  Hence,  P = h (O)  n cr 

is  an  (n-r) -cell.  | 

An  (m-l)-face  of  an  m-cell  is  called  a facet  of  the  cell. 

Let  ^ 5^  cp  be  a finite  or  countable  collection  of  m-cells  in 
some  Euclidean  space.  Let  ^7^^  for  i = 0, . . . , m be  the  set  of  i-faces 
of  the  elements  ofTtj',  we  call  members  of 
cells  ajid  vertices  of  07^  , respectively. 
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Definition  3.2.  Let  M = U cr.  We  call  (M,73^)  a subdivided  m-complex 
if 

a)  any  two  m-cells  of  ^ are  disjoint  or  meet  in  a common  face, 

b)  each  (m-l)-cell  of  lies  in  at  most  two  m-cells,  and 

c)  each  point  of  M has  a neighborhood  meeting  only  finitely  many 
m-cells  of 

Example  3.3.  Let  cr  be  an  m-cell,  then  (M,7?y)  = (ct,  {a})  is  a sub- 
divided m-complex. 

The  next  result  will  be  useful  in  the  definition  of  a path 
method  in  Section  IV. 3. 

Proposition  3. If  7n=  t T^|i  € q } is  the  collection  of  facets, 
of  an  m-cell  ct,  then  ( U t.,%7)  is  a subdivided  (m-1) -complex. 

i€a  ^ 

Proof.  For  simplicity  consider  a cell  a which  is  defined  only  by 

inequality  constraints:  cr  = {x  € ]R"^|b^(x)  <0,  i £ £} . The  cells 

T.  are  defined  as 
1 

= {x  € E^lbj^Cx)  = 0,  bj(x)  < 0,  V j ^ i)  • 

a)  To  prove  that  the  first  part  of  Definition  3.2  holds,  it  is  sufficient 
to  show  that  0 ^ >+).  Suppose  y £ t?  D for  some  j,  k € £. 

Then  identical  in  some  (ra-1) -dimensional 
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I I 


neighborhood  B(y, e)  n t?.  This  implies  that  Vb.(y)  = cUTb  (y)  for 

J J 

some  a > 0,  which  contradicts  the  constraint  qualification  in  the 
definition  of  an  m-cell. 

Next  we  show  that  each  (m-2)"Cell  lies  in  at  most  two  (m-l) -cells  of  77J- 
Suppose  that  an  (m-2)-cell  y lies  in  G ^ . Then 

r <=  6 = {x  e ]R”|b^(x)  =0,  i e 3,  b^(x)  < 0,  i G £\  3).  At 
some  point  y G there  is  an  e > 0 such  that  y n B(y,  e) 
is  an  (m-2) -manifold.  But  by  Lemma,  3.1,  5 is  a cell  of  dimension 

n-3,  and  we  have  a contradiction. 

c)  This  last  requirement  follows  from  the  fact  that  the  number  of 
cells  in  is  finite.  | 

3 

Example  3.3.  Consider  three  2-cells  in  ]R  which  meet  in  a common 
facet.  Then  2 ^2*  ^3^  ^ subdivided  2-complex. 


r 1 


is  a.  subdivided  ni— compLex  for  some  subdivision 

; 9^,  we  call  M an  m-complex;  M may  have  many  possible  subdivisions. 

I The  following  lemma  is  trivial  to  prove,  but  it  is  very  important 

for  the  convergence  theorems  that  follow. 

F [ 


Lemma  3.6.  A connected  1-complex  is  homeomorphic  to  either  a circle 
or  an  interval. 

For  the  two  cases  of  5.6  we  call  the  connected  1-complex  a loop 
or  a path.  We  shall  use  path  and  curve  synonymously. 


ii 
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Let  M be  an  m-complex  subdivided  by  . By  the  boundary 

of  M,  3M,  we  mean  the  union  of  all  (m-1) -cells  of  ^ which  lie  in 

exactly  one  m-cell  of  9?^ . 

Lemma  3.9.  The  boundary  of  a complex  is  closed  in  the  complex, 
k k 

Proof.  Let  [x  ) a dM  be  such  that  lim  x = x.  If  infinitely  majiy 

k 

of  the  x^  lie  in  one  m-cel].  of  , then  x is  in  cr  and  hence 

in  M.  Otherwise,  x is  not  in  M due  to  condition  (c)  in  the  definition 

of  a subdivided  complex.  | 

We  are  interested  in  the  behavior  of  1-complexes  which  are  con- 
tained in  m-complexes  for  m > 1.  When  a path  has  certain  desirable 
properties  we  shall  say  that  it  is  "neat."  Let  M be  an  m-complex 
and  W a 1-complex  contained  in  M.  If  W is  closed  in  M and 
dW  = W n Sm  we  say  that  W is  neat  in  M.  This  terminology  is  used 
by  Hirsch  [1976]  with  reference  to  manifolds  with  boundary.  Some 
examples  illustrating  the  definition  are  below. 

Example  3.10.  The  1-complex  W (dashed  line)  is  neat  in  M in  the 
first  case,  not  neat  in  M for  the  others. 


1. 
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2. 


Next  we  extend  the  definition  of  "neat"  1-complexes  to  deal 
with  subdivisions  of  M. 

Definition  3.11.  Suppose  that  W is  a 1-complex  contained  in  M and  ^ 
is  a subdivision  of  M.  Let  'W  be  composed  of  loops  and  paths 
r c (T  n W for  each  cr  c 9/^,  We  say  that  W is  neat  in  if 

a)  W n CT  is  neat  in  a for  any  a € 9?^  , 

b)  is  a subdivided  1-complex,  and 

^ 0 

c)  d(w  n 0")  c U T where  {x,  |i  £ l)  is  the  set  of  (m-l)-cells  of 

i=l  ^ ^ ~ 

iTi  OrT-h. 

Condition  c)  means  that  when  W hits  ^cr,  for  any  cell  a, 
it  hits  only  one  facet  of  a. 

22 


i 


Lemma  3.12.  If  W is  neat  in  (M,  then  W is  neat  in  M. 

Proof.  Part  a)  of  the  definition  implies  the  result.  Since  W D a 
is  closed  in  ct  for  any  ct  G % W is  closed  in  M.  If  x € W D ^M, 
then  X € W n dcr  for  one  cell  a.  Thus,  x G ^ (w  D a)  for  one  cell, 
and  we  have  x G dw.  If  x G dw  and  x is  in  only  one  cell  a,  it 
is  clear  that  x G W D dM.  If  x G 3w  and  x G n then  for  one 
of  the  cells,  say  a^,  W n Cg  contains  a connected  component  con- 
sisting of  the  point  x.  This  contradicts  the  requirement  that  W D cr, 

i 

is  a 1-complex.  So,  = W n 5m.  | 

Example  3.13.  W is  not  neat  in  (M,9j^)»  W n is  not  in  a^, 

L = i 


> 


/ 


Example  3.14.  W is  neat  in  (M, 


^ W 
\ 

\ 

I 

/ 


Next  we  prove  the  theorem  which  says  that  for  almost  all 


values  cp,  f”^(cp)  is  neat  in  o’. 


Theorem  3.1^.  Consider  f:a  IR  where  o is  a bounded  m-cell 
and  f is  C^.  There  is  a closed  set  of  measure  zero  Z cz 
such  that  for  any  cp  € f(o-)\  Z,  f”^(cp)  is  neat  in  (a, {a)). 

Proof.  Let  {x^ji  € £}  be  the  set  of  facets  of  cr,  and 

{T^|i  = q+1, be  all  r-faces  of  a for  r < m-1.  x^  is  a 

compact  set  of  dimension  less  than  m-1  for  j € £\  £.  Let 
ni*l- 

vj  =f(x  ) cz  E “ be  the  image  of  x.  under  f for  J ^ i,\ 

V J «J 

V.  is  a closed  set  of  dimension  less  than  m-1.  Hence,  v = Uj^.. 

J j€  £ 

ni*l 

is  a closed  set  of  measure  zero  in  E , and  if  cp  ^ v,  then 

-1  ^ n 

f "^(0;)  n 3cr  c U x;. 


Since  i € £ is  contained  in  {x  c ]R'^|b^(x)  =0, 

bj(x)  =0,  j e £\  g],  a -(ra-1) -manifold,  there  is  a compact  C^- 
manifold  with  boundary  U such  that  c;  dx^  euid  cr  c:  X^. 

By  Theorem  2.5  (Sard's)  auid  Theorem  2.7, the  set  c;  of 

critical  values  for  f|Xj^  and  for  f|3X^  has  measure  zero  for  each 
i € £.  Also,  for  f ^(cp)  is  a 1-manifold  neat  in 

Xj  U bX^. 

To  show  that  is  closed,  let  ^ be  a convergent  sequence 

k — —1  t 

{<p  ) c together  with  its  limit  point  cp.  f (J  ) must  contain 

It  it  it 

a sequence  {x  } such  that  f(x  ) = cp  for  every  k.  Since 

it  — 

Xj^  U dX^  is  compact,  {x  ) has  a limit  point  x.  We  know  that 
f (x)  is  continuous  and  that  every  minor  of  size  m-1  is  zero  for 
X ef  Hence,  x is  a critical  point,  and,  by  continuity,  f(x)  =9 


and  5 is  in  C.. 

^ 1 

Since  i S £ was  arbitrary,  = C is  a closed  set  of 

measure  zero.  We  now  have  that  Z = C U v is  a closed  set  of  measure 


zero.  All  that  is  left  to  show  is  that  for  cp  € f(a)\  Z, 

^(f"^(cp))  = r’-Ccp)  n So. 

For  X £ da  n f ^(cp),  there  is  some  i such  that  x £ x^. 

Thus,  there  is  some  neighborhood  of  x such  that  f)  a = (9' 

= n (X^  U dXj^).  Since  cp  ^ C,  x £ n dX^^  = dfl"^(cp).  f is 

^ i i 

identical  to  f\y  on  & so  X £ df  (cp),  eind  we  have  shown 
^i 

(da  n f‘^(cp))  c df"^(cp). 

Suppose  X £ df”^(cp).  If  x £ da,  an  argument  similar  to  the 
one  above  would  show  that  x £ da  H f ^(cp).  If  x ^ da,  then  for  any 
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o 


X € but 
identical  to  f, 
the  case  x £ da 


X ^ dX  . Thus,  X 4 ('P)' 

-1  ^ 

and  we  have  x ^ f (cp),  a contradiction. 


is  the  only  one,  and  we  axe  done. 


fL  is 
*i 

Therefore, 


We  will  call  values  cp  which  are  in  f (cr)  \ Z,  as  defined 
in  the  last  theorem,  good  values  of  f with  respect  to  cr. 

In  the  theorems  that  follow,  the  assximption  is  commonly  made 
that  0 is  a good  value  of  f with  respect  to  cr.  Theorem  5 . 11 
Implies  that  if  this  assumption  is  true,  then  there  is  a neighborhood 
of  0 consisting  of  good  values.  This  is  important  from  a computational 


point  of  view  because  in  this  case  if  the  algorithm  stays  in  B(f”^(0),e), 
for  some  e > 0,  no  singularities  or  branch  points  of  a path  will  be 
encountered. 

Another  useful  fact  is  that  if  0 is  a critical  value,  then 
£iny  neighborhood  (5'  of  0 contains  a regular  value  cp.  However,  it 
is  possible  that  there  are  no  good  values  in  O'.  For  our  applications 
of  this  theory,  however,  it  will  always  be  true  that  for  some  facet  t, 
f|^(x)  = 0 will  have  a unique  solution  x®,  and  f|'^(x^)  will  be  of 
full  rank.  By  the  inverse  function  theorem,  any  sufficiently  small 
neighborhood  & of  Q will  be  in  the  range  of  f,  and,  hence,  we 
can  conclude  that  O'  contains  a good  value  cp  of  f with  respect 
to  a.  Thus,  when  we  malce  the  assumption  that  0 is  a good  value 
for  f in  the  sequel,  we  make  it  with  the  understanding  that  if  this 
is  not  true  we  can  deal  with  a perturbation  cp  of  0. 


I 
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A more  satisfactory  resolution  of  this  regularity  problem  would 
be  to  use  some  deeper  transversality  results  of  differential  topology 
(Hirsch,  Section  5.2,  [1976]).  With  these  results  we  could  perturb 
f to  f if  0 was  not  a good  value  of  f w. r.t.  a.  We  have 
avoided  this  course  because  of  the  excessive  amount  of  preliminaries 
which  it  would  require. 

It  is  important  to  note  that  if  0 is  a good  value  of  f,  then 
X C f ^(O)  n means  that  x is  in  only  one  facet  of  Hence, 

there  is  at  most  one  adjacent  cell  a2  in  which  one  could  continue  to 
follow  f"^(0) . 

We  say  that  cp  is  a good  value  of  f with  respect  to  (w. r.t.) 
(M,  Ty)  ii"  'P  is  a good  value  of  f w.r.t.  a for  every  a € 

Notice  that  if  the  subdivision  of  is  finite,  then  Theorem  5.15 
immediately  yields  the  following 

Corollary  5.16.  Let  (M, 5^)  te  a bounded  subdivided  m-complex  ajid 
\'h[\  be  finite.  If  f;M  IR®"^  is  C^,  then  there  is  a closed 
set  of  measure  zero  Z c such  that  the  set  of  good  values 

of  f w.r.t.  (.M,  ?7)  can  be  written 

® = f(M)\  Z. 

The  next  theorem  is  our  main  result  concerning  good  values. 
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Theorem  3,17.  Let  f : M TR “ he  a continuous  map  such  that  f | 

is  on  each  cr  c , where  (M,  is  a subdivided  (m+1)- 

complex,  and  each  a cz‘7l]  is  bounded.  If  cp  is  a good  value  of  f 
w. r.t.  then  f~^(<p)  is  a 1-complex  neat  in 


Proof.  Since  cp  is  a good  value  for  f | for  aav  ff  £ 

f"^(cp)  n a,  is  neat  in  (cr,  {cr])  for  each  a€  9^,  by  Theorem  3.14. 

Thus,  parts  1)  and  3)  of  Definition  3.10.5  are  demonstrated.  Let 
W = f ^(cp)  eind  be  composed  of  the  connected  components 
y c=  (cr  n W)  for  each  cr  6 0^,  All  that  is  left  to  prove  is  that 
(W,  is  a subdivided  1-complex.  We  now  show  that  the  three  conditions 

of  Definition  3.3.  are  satisfied. 

a)  Suppose  that  n (2^  for  A^,  A^  £ then  c:  0 f”^(cp) 

and  ^2  ^ ^ "the  definition  of  segments.  It  is  clear 

that  A^  cr  and  A^  cr  because  3A^  c:  da^,  i = 1,2.  From 

n cTg  = we  have  that  A^  and  A^  must  meet  in  a boiuidary 
point. 

b)  If  y is  a boundary  point  of  distinct  for  i = 1,2,3 

then  y £ c for  i G ^ vdiere  t is  a facet  of  3 

i € 3.  This  contradicts  the  fact  that  is  a subdivision  of  M, 

c)  For  any  y G A^^  cr  f"^(cp),  if  y £ cr^  for  only  one  £ ^ then 

there  is  a neighborhood  O'  of  y for  which  O r\  f ^(cp)  meets  only 

Aj^.  If  y £ cr2  also,  then  b)  implies  that  y is  a boundary  point 

of  exactly  two  segments  A^  and  A^.  Since  cp  is  a regular  value 

for  f|  , f|  , f|  „ , one  can  find  a neighborhood  of  y 

CTi  CTj^l 

containing  only  A^^  and  A^.  | 


I 


The  following  proposition  is  of  critical  importance  in  applications 
of  the  theory  of  subdivided  manifolds. 

Proposition  3.18.  If  the  subdivision  of  is  finite  and  M is 
bounded,  then  f”^(cp)  contains  an  even  number  of  boundary  points. 

Proof.  Let  |C  j denote  the  number  of  connected  ccanponents  in  the  set 

C.  To  show  that  the  subdivision  of  f ^(cp)  is  finite,  all  ve 

need  show  is  that  |f”^(cp)  n cr|  is  finite  for  any  ct  C 'hj.  Since  ’ 

cr  is  closed  and  M is  bounded,  cr  must  be  ccanpact.  Suppose  there  are 

an  infinite  number  of  distinct  connected  components  of  f ^(«p)  D a, 

{A^},  i = 1,2,...  . Then  pick  a point  € A^,  i = 1,2,...  . Since 

a is  compact,  we  can,  by  choosing  a subsequence  if  necessary, 

assume  that  lim  = x € a,  and  f(x)  - cp  by  continuity, 
i _>  00 

By  making  a extension  of  f to  an  open  set  C c:  a we  can  use  the 

fact  that  cp  is  a regular  value  of  f to  apply  the  implicit  function 

theorem  [Ortega  and  Rheinboldt  (1Q70),  p.  128]  and  conclude  that  there  is 
a neighborhood  3"  of  x and  a curve  x(t):(t^, t^)  -» C, 

t^  < 0 < tg,  x(0)  = X,  such  that  f”^(cp)  H (?' = {x(t)|t  € (t^jt^)) 
which  contradicts  the  fact  that  x was  a cluster  point  of  (x^). 

Thus  the  subdivision  of  is  finite. 

Consider  any  connected  component  y e f ^(cp) . If  y is  a loop 
it  has  no  boundary  points.  If  y has  one  boiindary  point  x^  c=  A^  c y, 
then  it  must  contain  another:  The  subdivision  of  y must  be  finite 
= {A*^,  A^,  ...  , A*^} . x*^€  A^"^  n A*^  is  a boundary  point  of  A^. 
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If  c;  a,  then  ct  fl  f ^(cp)  is  neat  in  a and  the  fact  that 
is  closed  in  the  compact  set  cr  imply  that  there  is  another  point 

is  in 


of  X*^  and  c r*^  where  t is  a facet  of  a.  x^^^ 


no  other  cell  ct' , otherwise  X^  would  not  he  the  last  segment  in 


the  connected  set  y.  Thus  x^^^  is  the  other  boundary  point  of  y. 


Since  this  argument  could  be  repeated  for  each  of  the  finite  number 
of  components  y c f ^(cp),  the  number  of  bovindary  points  of  f"'^(ip) 
is  even.  | 


Corollary  3.19.  If  x*^  is  a boTondary  point  of  f~^(q))  where  cp  is 


a good  value  and  M is  compact,  x £ SM  and  the  other  boundary  point 


X*  of  the  path  y containing  x is  in  the  boundary  of  M. 
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CHAPTER  III 


INTRODUCTION  TO  PATH  FOLLOWING  ALGORITHMS 

III.l.  Ob,1ectlves 

In  this  chapter  vre  present  some  of  the  possible  algorithms  for 
following  F”^(0)  for  some  deformation  F ; M N which  is  defined 
in  terms  of  f vdiere  M c N c ]R^.  We  will  not  discuss  the 

exact  manner  in  which  F is  defined — that  is  the  subject  of  the  next 
chapter.  We  will  merely  be  assuming  that  an  initial  point  (x^,0*^) 
is  available  which  is  in  SM  0 F ^(O),  and  if  we  can  follow  the  component 
y c f”^(0)  containing  (x^,0^)  until  we  reach  the  other  point 
(x*,  0*)  € y n 5M,  we  will  have  found  a solution  x*  to 

f(x)  = 0 , X e S . (1.1) 

We  will  assume  that  y is  a C^-diffeomorphism  of  a closed 
interval  in  IR  ^ (which  will  be  true  if  0 is  a good  value  for  F 
w. r,t.  M) . Hence,  there  is  some  differentiable  parametrization 
cp  = [0,T]  -» M such  that  y = { (x,  0)  | (x,  0)  = cp(t)  for  some  t £ [0,T]} 
and  cp(0)  = (x^,0*^).  Clearly  then,  we  have 

F((p(t))  = 0 , t G [0,T]  . (1.2) 

Differentiating  (1.2)  with  respect  to  t we  get 

P’(9(t))  • = 0 . 

Define  (I)(t)  = dcp/dt.  We  can  completely  specify  cp(t)  as  the  solution 
to  the  initial  value  problem 
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I 




(1.3a) 


F'  (cp(t))  qj(t)  = 0 , 

(1.3a) 

o 

><! 

II 

(1.3b) 

(p(t)  e M for  t > 0 , 

(1.3c) 

ll<?>(t)||  = 1 . 

(1.3d) 

(cf,  Kellogg,  Li,  and  Yorke  [1976]). 

The  condition  (l,3c)  fixes  the  sign  of  the  initial  tangent  vector 
q)(0),  and  requires  it  to  point  into  M.  The  condition  (l.5d)  implies 
that  the  parameter  t is  the  arc  length  along  the  curve.  Under  some 
rather  strong  conditions,  numerical  integration  techniques  such  as 
Euler' s method  or  the  Runge-Kutta  technique  can  be  used  to  find  the  end- 
point (x*, ©*■)  = cp(T)  (Ortega  and  Rheinboldt,  pp.  539-5^  [1970]). 
Kellogg,  Li,  and  Yorke  [1976]  describe  a similar  algorithm  under  more 
general  conditions,  but  do  not  give  any  convergence  proof. 

We  will  briefly  describe  an  algorithm  similar  to  the  one  described 
in  Kellogg,  Li,  and  Yorke,  so  that  we  cam  contrast  it  with  the  one 
for  which  we  will  prove  a convergence  theorem.  This  algorithm  takes 
short  steps  in  the  direction  of  q)(t)  as  described  in  (1.3)  and 
shortens  the  steplength  if  |1f(x,0)||  changes  too  much.  An  assumption 
of  this  algorithm  is  that  =0  suid  G is  monotonically  increasing 
along  r The  algorithm  is  as  follows: 


i, 


t 


1 
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Algorithm  1.1, 

0.  Pick  (x^,0)  € M such  that  F(x^,0) 
c > 0.  Set  i = 0. 

1.  Compute  F'(x^, 0^). 

2.  Compute  (x, 9)  such  that 


= 0,  and  pick  numbers  h > 0, 


= 0 and  0 > 0 . 


3.  Replace  (x,  0)  by  (x,  0) /||  (x,  0)  || 


4.  Set  = (x^,e^)  + h(x,0) 

5.  If  ||f(x^'^^^  0^  - F(x^,0^)li  > £,  replace  h by  h/2  and  go 

to  Step  4.  Otherwise,  go  to  next  step. 

6.  If  (x^"*"^,  0^^^)  ^ M^,  exit  with  0^^^)  as  the  approximate 

solution  to  the  problem  of  finding  F"^(0)  n 5m.  Otherwise, replace 
i by  i+1  and  return  to  Step  1. 


Clearly  the  approximate  solution  derived  from  this  algoritlim 
could  be  used  as  a starting  point  for  some  iterative  technique  such  as 
Newton's  method  to  find  a more  accurate  solution.  A possible  course 
of  Algorithm  1.1  is  given  in  Figure  1.2. 

Notice  that  after  the  third  iteration  the  stepsize  was  cut  in 
half  due  to  Step  5.  In  this  case  the  algorithm  performed  well  and 
resulted  in  a good  approximation  to  (x*,  9*) . 


( ' 


After  the  jump  from  0^)  to  (x^,0^),  F(x^,0^)  is  still 

near  zero  but  (x^,6^}  is  near  a new  component  of  F ^(O),  a loop. 

The  algorithm  will  fail  to  reach  (x*,  . Even  with  a very  small  step 
size  h and  tolerance  € it  appears  difficult  to  prove  that  this  type 
of  behavior  will  never  occur. 

The  next  difficulty  appears  even  more  disadvantageous.  It  is 
caused  by  the  naive  method  for  choosing  the  sign  of  (i, 0)  in  Step  2. 


(x*  £^) 


At  the  point  (x^,0^),  the  algorithm  would  choose  d^  as  the 
direction  for  (x, &)  while  d^  would  be  the  direction  in  which  to 
continue  to  produce  a consistent  movement  along  F”^(0).  Any  time 
d0/dt  = 0 along  F”^(0),  Algorithm  1.1  is  going  to  have  difficulties. 
What  is  needed  is  a way  to  determine  orientation  which  is  not  dependent 
upon  a notion  of  monotonicity  in  any  variable. 


I 

j 


I 


I 
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In  the  next  section  we  will  prove  the  necessary  result  concern- 


! 


,1 

! 

i 


I 


ing  orientation.  It  says  that  the  sign  of  x^,  for  some  i £ n (or 
0)  is  determined  by  the  sign  of  the  determinant  of  the  matrix  formed 
by  removing  the  colxann  corresponding  to  x^  (or  0)  from  F'(x, 0). 

In  the  following  section  we  shall  describe  an  algorithm  which 
uses  the  orientation  result  of  Section  2 and  returns  to  the  curve  f”^(0) 
periodically  to  prevent  the  situation  of  Figure  1.2  from  occurring  if 
the  stepsize  and  tolerance  is  small  enough. 


III. 2.  The  Orientation  of  Subdivided  Complexes  and  Curve  Index 

Before  discussing  the  orientation  of  paths  we  must  extend  the 

notion  of  oriented  manifolds  with  boundary  (cf.  Section  II. 2)  to  a 

definition  of  an  oriented  subdivided  m-complex. 

Consider  a cell  ct  with  an  orientation  cu  defined  for  each 

point  in  the  smooth  manifold  cP . For  any  point  in  where  t is  a 

facet  of  CT,  define  the  induced  orientation  of  t exactly  as  if  x 

0 2 

was  contained  in  where  E a is  a C -manifold  with  the  same 

orientation  as  cr. 

Let  (M,  7?})  be  a subdivided  m-manifold  with  m > 1,  and  each 
m-cell  of  9^  is  oriented.  Let  x be  a facet  contained  in  two  m-cells 
cr^  and  If  x receives  an  opposite  orientation  from  and 

o'2>  we  say  that  (U,7)J)  is  oriented.  When  we  say  that  is 

positively  oriented  we  shall  refer  to  the  orientation  of  each  m-cell  as 
being  positive. 
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Example  2.1.  Oriented  subdivided  complexes. 

Notice  that  if  (M,  is  a subdivided  2-complex  that  a 
"clockwise"  or  "counterclockwise"  property  of  the  ordered  bases  determine 


the  orientation  of  a cell.  If,  in  rotating  from  b^  to  b^  in  the 


shortest  direction,  one  rotates  in  a clockwise  direction  for  all  a € 
then  (M,977)  is  oriented. 


If  M is  an  m-complex  in  IR  we  shall  by  convention  give 


it  the  standard  orientation  cjd  . That  is,  each  ra-cell  is  oriented 

m 


by  It  is  easy  to  verify  that  is  oriented  in  this  case. 

Next  we  discuss  the  important  concept  of  curve  index.  The 
development  in  this  section  is,  sigain,  a generalization  of  the  results 
on  curve  index  in  Eaves  [ 1976] . It  is  the  last  result  in  this  section. 
Proposition  2.3,  which  allows  us  to  decide  which  direction  to  move 
in  a discrete  path  following  algorithm. 

Let  F ; M ->  N by  a map  from  the  oriented  (n+l)-complex  (M,^) 
to  the  oriented  n-coraplex  (N,  9^).  Let  y be  a good  value,  (W,2J) 
be  the  1-complex  F ^(y),  eind  (W,  7<r)  have  an  orientation.  Let  a 


J 


be  an  (n+l)-cell  of  containing  x,  [v]  be  an  orientation  of 
(TW)^  in  a,  and  [B,v]  orient  cr^  with  the  same  orientation  as  M. 
Let  T be  the  n-cell  of  9j  containing  F(a),  C orient  (Tt)  and 

y 

be  any  matrix  with  C^C  = I.  The  curve  index  of  x is  defined 


to  be 


sgn(det  C^F'(x)B)  . 


The  curve  index  is  the  sign  of  the  determinant  of  the  following 


set  of  maps 


F’  (x) 


->  Tt 


if  and  N c then  € E “ ^ F' (x)  e E ^ 


BSE 


Lemma  2.2.  The  curve  index  is  well-defined,  nonzero  and  constant 


on  any  oriented  path. 


Proof.  Part  1)  shows  that  the  curve  index  is  independent  of  the 
choice  of  B and  C,  Part  2)  shows  that  it  is  independent  of  x € W. 
First  we  note  that  for  points  a in  Tt^,  a = CX  for  some  X £ E^, 


C'^a  = C'^CA  = A , 

so  , 

CC  a = CA  = a . (2.1) 

If  we  let  G also  satisfy  GC  = I,  we  have 

det  c'^f'(x)B  = det  aCC'*’Df(x)B  = det  G f ’ (x)B  , (2.2) 

the  last  equality  following  from  (2.1).  Next,  let  and 

represent  orientations  of  Ter  and  To,  respectively,  for  i =1,2, 

X y 

where  = (B^,v^)A,  = C^E(det  A,  det  B > 0)  and  [v^]  = 

[v^]  = (TW^).  Clearly, 

sgn  det  Cg  F’  (x)B2  = sgii  det  C^F'  (x)B2  , 

using  = I,  det  E"^  > 0,  and  (2.2). 

Note  that  since  F is  constant  along  W,  F' (x)v^  = F' (x)v2  = 0. 

Hence, 

(F’ (x)B^D^,0)  = F* (x) (B^,v^)D  = F' (x) (B2,V2) 

= (F'(x)B2,0) 

«»-  „ "A 

°’U  ^/’ 

and  is  n x n.  But  F'(x)B^d2  = 0 implies  d^  = 0 (F’(x)B^ 

has  full  column  rank),  [v^]  = [v^]  implies  v^  = ov^  for  a > 0 
so  we  have  dj^  > 0.  Hence,  det  > 0 and  sgn  det  C2F'(x)B2 
= sgn  det  C^F*(x)B^  so  the  curve  index  is  well  defined  at  x. 
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Next  we  show  that  the  curve  index  is  constant  for  any  x G W fl  or^. 
We  can  choose  the  basis  B as  a continuous  function  of  x,  B(x),  along 
W because  B(x)  is  a choice  of  a linearly  independent  basis  in  Ta^ 
and  cP  is  a smooth  manifold.  can  be  constajit,  hence  d(x)  = 

det  C^F’ (x)B(x)  is  a continuous  function  of  x.  Since  we  have  shown 
that  the  curve  index  is  nonzero  at  some  point  x I W H Suppose  some 

point  X £ W has  opposite  curve  index  to  x.  Then 

det  c'^F'(x)  B(x)  • det  c'^F'(x)B(x)  < 0. 

Using  a parametrization  cp(t)  of  W,  there  must  be  some  t £ (t,t), 

\rtiere  x = cp(t)  and  x = cp(t),  such  that 

det  C'^F’ (cp(t°))  B(cp(t°))  =0  , 

has  row  rank  n and  B(cp(t®))  has  column  rank  n by  definition, 
so  F’  (^p(t*^) ) has  rank  less  than  n.  This  contradicts  the  fact  that 
y is  a regular  value  of  F and  F(cp(t^))  = y. 

If  cr  ax  is  unique  we  have  shown  that  the  curve  index  is 
constant  on  o)  n cP . If  x lies  in  more  them  one  cell,  then  it  lies 
in  exactly  two  (n+1) -cells  of  9^7  > because  y is  a good 

value  for  F w.r.t,  (M,  97)*  Let  t = C\  and  choose  an  orientation 
[B]  of  Tt  . Let  v^  G T(W  H q-  ) and  v £ T(W  D a ) be  chosen 
consistently  with  the  orientation  of  (W,ai).  Then,  since  0 is  a 
regular  value  for  f|^,  (B,Vj^)  span  To^^  smd  B can  be  chosen  so 
that  (B,v^]  is  the  orientation  of 

UO 


Due  to  the  definition  of 


an  oriented  subdivided  manifold,  B has  the  opposite  orientation  to  that 


induced  on  x by  But,  since  is  an  "inward"  vector  for 

it  follows  that  the  curve  index  is  constant  and  well  defined  on 
oriented  1-complexes.  | 

The  notion  of  curve  index  relates  the  behavior  of  the  curve  W 
11  to  the  Jacobian  of  nap  F which  defines  W.  The  next  Theorem  will  be 

[I 

used  repeatedly  for  the  definition  of  algorithms  and  for  proving 
theoretical  results. 

Suppose  that  M c ]R  , y is  a good  value  for  F w. r.t. 

(M,  and  r c F ^(y)  is  a path  oriented  so  that  its  index  is 
i = + 1.  Let  (x, 0)  = 9(t)  be  the  direction  of  r at  (x,0),  and  let 
F' (x)  = (E,e)  where  E £ 

Theorem  2.3.  sgn  « = i-sgn  det  E. 


r 


and  we  have  i-sgn  det  E = 1 > 0. 
If  6 < 0,  then  let 


(B,b) 


I X 


0 0 


where  I = (e^, ...,e^  -e”)  so  that  (B,b)  represents  the  standard 

orientation  in  E We  have 


i = sgn  det  i(E, e)B  = - sgn  det(E,e)B  = - sgn  det  E , 

smd 

i • det  E < 0 . 


Thus,  we  have  the  important  property  that  the  sign  of  h chajiges 
as  the  sign  of  the  determinant  of  E.  Clearly,  a simileir  statement  holds 
for  any  coordinate  of  (x, 6)  and  not  Just  the  last  one. 

The  curve  index  is  essentially  arbitrary,  so  we  must  choose 
it  in  such  a way  that  our  algorithms  will  move  in  the  desired  direction 
along  the  path  r-  We  are  always  given  an  initial  point  {^,0^)  £ 5cr 
for  some  cell  a = {x|bj^(x)  <0,  i £ m)  and  we  want  to  follow  F~^(0) 
into  the  cell  a.  Thus  if  we  are  (x,0)  in  the  null  space  of  F'(x^,0*^) 
we  know  which  sign  to  give  (x, 9)  so  that  it  points  into  a.  Knowledge 
of  9^  and  sgn  det(dj^F(x^,  0®) ) along  with  Theorem  2.3  let  us  define  i, 
the  curve  index  in  the  appropriate  way, 

i = sgn  9 • sgn  det(^^F(x*^,  9*^) ) . 


1+2 


I 


1 


. 

« 

After  i is  determined  at  (x^,d^)  £ dM,  Theorem  2.3  can  be  used  to 
determine  the  appropriate  sign  for  (x,6). 

Next  we  give  explicit  directions  for  implementing  Theorem  2.3 
in  a path  following  algorithm.  The  numerically  sophisticated  reader 
may  find  the  exposition  above  to  be  a sufficient  description  and  may 
want  to  skip  to  the  next  section. 

' In  this  section  to  avoid  singling  out  any  component  we  consider 

a problem  of  following  F ^(O)  across  a cell  a = {x|b^(x)  < 0,  i £ m) 
where  a a and  F maps  a into  Assume  that 

£ f”^(0)  n ba  satisfies 

b^(x^)  = 0 , h^(x*^)  < 0 , i 1. 

Let  F\(x)  be  the  matrix  F' (x) . , ^ . 

First  we  will  discuss  the  problem  of  finding  a nonzero  element 
of  the  null  space  of  F'(x).  We  are  ass\aming  that  0 is  a regular  value 
for  F,  so  that  in  some  nei^borhood  of  F”^(0),  F*  (x)  has  rank  n. 

Thus,  some  n x n submatrix  of  F' (x)  has  full  rank.  To  find  a 
V ^ 0 such  that  F*(x)v  = 0,  first  try  to  solve 

F’(5)y  = - F’(5).,^  (3.1) 

for  J = n+1.  If  is  singular,  let  j = j-1  and  again  try 

to  solve  (3.1).  Eventually,  a j will  be  foiind  such  that  F^(x) 
is  nonsingular.  Then  v defined  by 

I 

t 


k 


Yi  , 


^ = 


for  i < j 
1 , for  i = J 


i-r 


for  i > j 


satisfies  F'(x)v  = 0.  In  the  algorithm  below  the  phrase,  "find  v 
such  that  F' (x)v  = 0,"  means  that  a procedure  such  as  the  above  is 
to  be  followed. 


The  easiest  way  to  calculate  sgn  det  F'. (x)  is  to  keep  track 

(3 


of  certain  things  as  the  system  (3.l)  is  being  solved.  A common  (and 
efficient)  method  for  solving  a system  of  equations  is  to  develop  a 


LU  decomposition  of  F'. (x)  using  Gaussian  elimination  with  partial 

(] 


pivoting.  In  such  an  algorithm  one  can  determine  sgn  det  Fj(x) 


= sgn  det  LU  by  the  following  procedure 


0.  let  k = 1 

1.  each  time  two  rows  of  U are  interchanged  let  k = -k 

2.  each  time  a negative  diagonal  element  is  calculated 


let  k = -k. 


The  result  k will  be  the  sgn(det  F'.  (x))  because  L is  lower  tri- 


angular with  ones  along  the  diagonal  and  U is  upper  triangular. 

Thus,  in  the  sequel  when  we  say  "compute  sgn  det  E,"  we  mean 
that  a procedure  similar  to  that  above  should  be  followed  and  no  real 
work  need  be  done. 

We  siammarize  this  process  in  algorithmic  form.  We  first  show 
how  the  initial  unit  tangent  to  F ^(O)  pointing  into  a is  found. 


LU 
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0.  Given  such  that  x*^  c a = [x|b^(x)  <0,  i £ ra]  and 

b^(x°)  = 0. 

1.  Find  V such  that  F' (x^)v  =0  by  solving  a system  such  as  (5.1) 
and  let  i'  = sgn(det  F'. (x*^)), 

2.  Set  6 = -sgn(Vbj^(x^),v) . 

5.  Let  V = 5 • ^ , 

4.  Save  i = i' • 6. 


Now  we  have  the  curve  index  i and  the  initial  tangent  to  f”^(0) 

for  a curve  following  algorithm  to  move  along.  At  some  other  point 
k -1 

x near  the  path  F (O)  the  procedure  is  as  follows. 

1.  Find  V such  that  F’ (x^)v  = 0 and  let  i'  = sgn(det  F*.  (x^)). 

2.  Let  6 = i*i’ . 

5.  Let  V = B . ^ . 


III. 3.  Proving  Convergence  for  a Path  Following  Algorithm 

In  this  section  we  define  an  algorithm  which  will  follow  a path 

ni*^l 

across  a compact  cell  a c;  ]R  . Let  C be  the  connected  component 

of  F ^(O)  which  contains  a boundary  point  ^ of  F~^(0),  where 
rn  2 

F : a ->  ]R  is  C . The  algorithm  will  stay  near  the  curve  C euid 
converge  to  the  opposite  boundary  point  of  C,  x*  £ 3(r.  A 2-dimensional 

example  is  given  below. 
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Let  the  cell  be  defined  as  o s {x  £ E°^^|b^(x)  <0,  i £ ^} . 

We  assume  that  0 is  a good  value  for  F (see  Theorem  II. 5. 15). 

The  algorithm  is  essentially  a procedure  for  moving  a fixed 
distance  along  a tangent  to  a curve  near  C.  Then  a hyperplane  normal 
to  the  tangent  is  used  to  get  a fully  determined  system  of  equations. 
Newton*  s method  is  used  to  solve  this  system  approximately  until  we 
a^ain  have  a point  s'lfficiently  close  to  C.  This  subroutine  is 
repeated  until  the  opposite  boundary  has  been  reached.  At  this  point 
the  constraint  which  is  violated  is  augmented  to  F so  that  Newton’ s 
method  can  be  used  to  determine  the  boundary  point  x*  to  the  desired 
tolerance. 

A more  explicit  description  of  the  algorithm  is  given  below. 

The  theoretical  and  computational  question  of  interest  is  the  determine 
tion  of  T > 0,  the  steplength  along  the  tangential  approximation,  and 
€ > 0 the  termination  criterion  for  the  Newton  subroutines. 
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Path-Following  Algorithm  3.2. 

0.  The  initial  information  consists  of  positive  scalars  t,  e 
and  the  initial  point  Xq  ^ £ F ^(O)  fl  da.  k : = 0. 

1,  Determine  u(xj^  q),  the  tangent  to  f"^(0)  at  x^^  ^ so  that 
||u(xj^  q)||  =1  and  the  direction  of  u(Xj^  q)  is  consistent  with 
the  direction  of  movement  along  F ^(O). 


2.  Define  -»  to  be  F augmented  with  the  function 

iC 

defining  the  hyperplane  normal  to  u(xj^  q)  at 

F(x) 


GJx)  = 


£ :=  1 


\,1  ~ ^ 


3.  Use  Newton's  method  to  solve  Gj^(x)  = 0;  Calculate 


until  < £. 

Step  1.  Otherwise, 

/V 

5.  Let  i be  the  index  such  that  > 0,  := 


k ;=  k+1,  ajid  £ :=  1.  Define 


Gjx)  = 
hi 


F(x) 


^b^(x) 


6.  Use  Newton's  method  to  solve  0-  Iterate 

^k,i+l  " " i = 1,2,...  , 

until  i|G(xj^  ^+1  approximate  endpoint  x*. 

We  shall  show  that  for  some  choice  of  positive  scalars  x and  e, 

the  Jacobians  G' ( • ) are  of  full  rank  and  after  a finite  niunber  of 
k 

Newton  iterations,  an  approximate  solution  will  be  found. 

Notice  that  it  is  assiamed  in  Step  5 that  there  is  only  one 

constraint  i which  is  violated  by  the  first  x^^  ^ which  is  not  in 

cr.  We  shall  show  that  it  is  possible  to  choose  x and  e small  enough 

so  that  this  will  be  the  case.  However,  in  any  practical  implementation, 

strategies  are  available  which  should  be  used  to  designate  a particular 

constraint  if  more  than  one  constraint  is  violated  by  x . Such 

U 

strategies  were  not  included  in  the  algorithm  in  order  to  make  the 
convergence  proof  simpler. 

Example  $.3.  The  subproblem  in  Steps  1-U:  (the  subscript  k is  dropped) 


1+8 


Since  the  algorithm  uses  Newton' s method  to  calculate  a point 
in  the  intersection  of  F ^(O)  OH  and  £ H,  it  can  be  shown  that 
X , £ = 2,3,  ...  will  all  be  in  the  hyperplane  H. 

Af 

We  shall  first  prove  that  for  any  x^  £ B(C,e),  x^  = Xq  + tu(Xq) 
is  in  a neighborhood  of  y = F ^(O)  D H small  enough  so  that  the 
Newton  sequence  x^,  x^,  x^,  . . . converges  to  y.  First,  we  require 
some  definitions  and  preliminary  results. 

To  help  us  determine  when  certain  matrices  are  invertible  we 
need  the  following  lemmas  whose  proofs  are  in  Ortega  and  Rheinboldt 
[1970,  p.  45-46], 

Lemma  3. 4.  (Perturbation  Lemma).  Let  A,  C £ ^ and  assume  that 

A is  invertible,  with  ||A  ^||  < a (||a||  = sup  i|Ax|l).  If  ||A-C|1  < 6 

l|x||  = 1 

and  pa  < 1,  then  C is  invertible,  and 

||C"^||  < a/d-op)  . 

Lemma  3.3.  Suppose  that  the  mapping  A:D  c E ^ is  continuous 
at  a point  x*^  £ D for  which  A(x^)  is  invertible.  Then  there  is  a 
6 > 0 and  a y > 0 so  that  A(x)  is  invertible,  and 

j|A(x)l|~^  < Y,  for  any  x £ D fl  B(x^,5)  . 

We  also  use  the  following  mean  value  theorem: 
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Theorem  3.6.  Let  FtDcE^-^E”  be  continuously  differentiable  on 
a convex  set  c D and  suppose  that  for  constants  a > 0 and  p > 0, 
F'  satisfies 

||F'(u)  - F-(v)||  < a||u-v||P  , Vu,  V e Dq  . 

Then  for  any  x,  y G 

||F(y)  - F(x)  -F’(x)(y-x)||  < [a/(p+l)]  ||y-x|P^^  . 

Proof.  (Ortega-Rheinboldt  [1970],  p.  73). 

The  next  result  is  contained  in  the  proof  of  Kellogg,  Li 
and  Yorke' s Theorem  2.1  [I976]. 

Lemma  3.7.  There  is  an  open  set  V containing  F"^(0)  and  a continuous 
vector  function  u(x)  ^ 0,  x e V,  such  that  F' (x)  u(x)  =0,  x € V. 

For  our  purposes,  since  o and  C sire  compact,  we  can  find 
a ^ > 0 such  that  we  can  let  V =3  B(C,^)  z)  C,  and  for  x € B(C,^), 
u(x)  ^0  is  defined  and  F'(x)  u(x)  = 0,  x £ B(C,^) 

n n n 

Definiton  3.8.  Let  IR  be  the  product  space  ]R  x • • • x 3R  vdiere 

n^  + • • • + n^  = n,  and  denote  the  elements  of  K by  x = (x^,  ...,x^) 
i R tn 

with  X e ]R  , i e £.  Let  F = D c ]R  IR  and,  for  given 
X = (x^,...,x^)  e D,  set 


[|  aF(x)=F'(x^) 

ii  ^ ^ 


at  X with  respect  to  E ^ if  x^  £ int  D.  and  F.  has  an  F-derivative 

1 1 

at  x^. 

The  implicit  function  theoran  is  given  for  the  case  of  a con- 

2 

tinuous  partial  derivative  since  we  usually  assume  that  F is  C , 

Implicit  Function  Theorem  3.9-  Suppose  that  FtDcE^x 
is  continuous  on  an  open  neighborhood  Dq  c D of  a point  (x*^,y*^) 
for  which  F(x*^,y*^)  = 0.  Assume  that  d^F  is  continuous  and  non- 
singular at  (x°,y°)  and  that  it  exists  on  D^.  Then  there  exist 
open  neighborhoods  c;  E^  and  Sg  c E^  of  x*^  and  , 

respectively,  such  that,  for  any  ye  the  equation  F(x,y)  =0 
has  a unique  solution  x = H(y)  £ S,  and  the  mapping  H = E° 

is  continuous.  Moreover,  if  exists  at  (x*^,y*^),  then  H is 

F-differentiable  at  y*^  and 

H’(y°)  = -[SiF(x°,y°)]"^  ^2^(x°,y®). 

Proof.  (Ortega  and  Rheinboldt  [197O],  pp.  128-129.) 
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The  following  lemma  is  concerned  with  the  existence  of  a solution 


I 


I 


1 

!' 

[ 

i 

I 


► 


for  each  of  the  subproblems  in  Steps  1-4. 


Lemma  3.10.  Consider  the  function  h:a  X [0,t]  such  that  if 


G(v,x,t)  = 


F(v) 

(u(x),(x  + tu(x) ) - v) 


then  h(x,t)  satisfies  G(h(x, t),x,t)  =0,  for  t€  [0,t],  x€  B(c,e). 

There  exists  scaleurs  t'  >0  and  e'  >0  such  that  h exists 
and  is  continuous  and  3^G(h(x,t) ,x,t)  is  invertible  for  x G B(C,e') 
and  t G [0, t' ] . 

Proof.  Consider  the  point  (x^,  x^,  0),  x^  € C. 

d^G(x°,x*^,  0)  = 

Since  u(x)  is  continuous,  and  F is  C , S^G  is  continuous  at 
(x^,  x^,  0)  and  it  is  nonsingular  because  F' (x^)y  = 0 implies 
y = cai(x*^)  for  some  a G 3R  . The  implicit  function  theorem  5.8,  then 
gives  us  that  the  equation  G(v,x,t)  has  a unique  solution  v = h(x,t) 
for  (x,t)  in  a neighborhood  x T^  of  (x*^,  0)  and  h:S  x I -> 
is  continuous.  We  also  know  that  since  h(x^,0)  = iP , c)^G  is  continuous, 
and  d^G(x^,  P,  0)  is  nonsingular  that  there  is  a neighborhood 
S X I of  (x  , 0)  such  that  d^G(h(x,t),x,t)  is  nonsingular  for 


/ F'(x°)  \ 
\u(x°)  / 


52 


(x,t)  e X Let  S X I = (S^  X I^)  n (S^  X I^')  , 

Since  € C was  arbitrary,  we  can  index  S x I by  x (£  C 
euid  conclude  that 

U (S  X I)  3 C X {0} 
x€  C ^ 

is  an  open  cover  for  C x {0].  Since  C x (0}  is  compact  in  e”  x IR. 
there  is  a finite  subcover  K = [ (S  x I) i = 1, , . . ,p)  of  C x {0}. 

Let  d = inf{d((x,0) , (y, s) ) (x  £ C,  (y, s)  ^ K)  then  d is  positive 
because  otherwise  by  the  continuity  of  d,  there  is  some  (x,0)  £ C x {0} 
which  is  a cluster  point  for  a sequence  of  points  vdiich  are  not  in  K, 
but  (x,0)  is  in  (S  x I)^  for  some  i £ £ and  we  have  a contradiction. 
Since  d is  positive,  we  can  choose  an  e'  > 0 and  t'  > 0 
such  that  h(x,t)  is  continuous  on  B(C, e')  x and 

3^G(h(x,t) ,x,t)  is  nonsingular  for  (x,t)  £ B(C,€')  X [ -t  ,t  ].  | 


FIGURE  5.11.  A Depiction  of  h(x,t) 


Next  we  prove  the  main  result  of  this  section,  which  allows 


us  to  move  along  the  curve  any  finite  distance. 

2 2 

Theorem  3.12:  € ajid  t >0  can  be  chosen  so  that  in  Algorithm  1, 

after  a finite  number  of  repetitions  of  Step  3,  there  is  a k such 
that  4 o. 

Proof.  By  Lemma  3.10,  dj^G(h(x,t) ,x,t)  is  continuous  and  nonsingular 
for  all  (x,t)  e B(c,6')  X [0,t']  (=Bx  I).  By  Lemma  3.5 
S^G(h(x,t),x,t)  is  continuous  for  (x,t)  € B x I,  and,  since  B x I 
is  compact,  there  is  a p < + » such  that 

l!3^G(h(x,t),x,t)  II  < p V (x,t)  € B X I . 

Let  D be  sin  open  set  containing  a such  that  the  extension  of  F to 

— 2 — 

D,  F,  IS  C and  0 is  a regular  value  for  F.  Let  c D be 

siny  compact  set  such  that  C c intCD^);  then  S^G  is  uniformly  continuous 
on  Dq  X I,  and,  hence,  for  e e (O,  l/2P)  there  is  a 6 > 0 for 
which  B(h(x,t),6)  c:  for  all  (x,t)  € B x I and 

ll^3^G(y,x,t)  - S^G(z,x,t)||  < e 

(5.2) 

V y,  z € Dq,  ||y-z||  <6,  (x,6)  € B x I . 

Therefore  the  perturbation  lenmia  (3.^)  ensures  the  existence  of 

for  each  (x, t)  € B x I and  y £ B(h(x, t),6).  Moreover, 


we  have 


For  any  fixed  (x,t)  £ B X I,  consider,  now,  the  Newton  process 


- S^G(y^,x,t)“^  G(y^,x,t),  k = 0,1,,..  (3.4) 

with  y^  € B(h(x,t),6). 

Then  we  will  show  that 


l|y^  - h(x,t)l|  < o^,  k = 0,1,  ...  (5.5) 

where  a = Pe/(l  - (3e)  <1. 

(3.5)  is  true  by  assximption  for  k = 0,  and  if  it  holds  for 
some  k > 0,  then  y^  € B(h(x,t),8),  and,  hence,  by  (3.^),  (3.2),  (3.3) 
and  Theorem  3.6 

l|y^^^  - h(x,t)||  = ||y*^  - h(x,t)  - S^G(y^,x,t)"^  G(y^,x,t)|I 

< ||S^G(y^,x,t)||- ||G(h(x,t),x,t)  - G(y\x,t) 

- Sj^G(y^,x,t)  (h(x,t)  - y*^)!! 

< [p/(l-Pe)]e  ||h(x,t)  - y^ll 

< 8 . (3.6) 

(Note  that  Theorem  3.6  was  applied  with  a = e and  p = 0.) 

Therefore  the  bound  (5.5)  is  demonstrated  and  the  Newton 
sequence  (3.4)  remains  in  B(h(x,t),6)  and  converges  to  h(x,t). 


1 
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2 2 

It  remains  to  specify  an  e and  a x positive  so  that  for 

any  x € B(c,e),  x + tu(x)  = x 6 B(h(x,T),&).  Choose  e £ (0,e^)  so 

o 

that  for  any  x £ B(C,e  ),  d(x,h(x,0))  < 6,  This  is  clearly  possible 
because  for  x £ C,  d(x,h(x,0))  = 0 and  C is  a compact  set  and  both 
d and  h are  continuous  functions. 

Next  we  must  specify  x.  Consider  the  following  family  of 
sets  parametrized  by  t : 

= {x  £ B*^(C,6^)  |d(x,  s)  < &,  0 < s < t]  , t £ (O,  x* ) 


where  D(x,t)  = d(x  + tu(x),  h(x,t))  is  well-defined  and  continuous 
(Lemma  5.10  shows  that  h(x,t)  is  well-defined  and  continuous)  for 
X £ B°(C,€b,  t £ (0,x’).  is  an  open  set  by  the  continuity  of 

D(-,t).  For  any  x'  £ B(C,e^),  D(x,0)  < 6 by  the  definition  of  e. 

So  by  the  continuity  of  D(x, • ) there  is  an  interval  (0,t')  such 
that  for  s £ (0,t'),  D(x,s)  < 6 . Thus  3B(C,e^). 

By  compactness  there  is  a finite  subcover  if  A of  B(C,e^). 

1— -J_ 

Let  X = min{t.  |i  £ k),  then  A r>B(C,€^)  because  A 3A  for 

T T 

any  t.  > x by  the  definition  of  A . 

Now  all  that  is  left  to  do  is  show  that  we  csui  choose  a termina- 
tion tolerance  > 0 so  that  !|f(x)||  < and  x £ B(C,f^)  implies 
that  d(x,C)  < or  x £ B(C,e^). 

Define  the  function  y;  ]R  ^ IR  ^ as 


r(0!)  = max{d(x,C)|x  £ B(C,€^)  and  ||f(x)||  < a) 


(3.7) 
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r is  well  defined  because  d(*,C)  is  continuous  and  the  constraint 

set  is  compact.  Clearly  (p(0)  = 0 because  ||F(x)||  = 0 and 

X € B(C,p^)  means  that  x £ C because  was  chosen  so  that 

B(C,€^)  n F”^(0)  = C.  If  we  can  show  that  y is  right  continuous, 

3 2 

then  we  can  choose  an  e > 0 so  that  r(€  ) < e and  we  will  be  done. 

Let  {q!^  ] be  a sequence  such  that  linij^  = a and 

Let 

x^  -=  argmax{d(x,C)  |x  £ B(C,€^),  11f(x)11<q^}  , k = 1,2,  ...  . (3.8) 

k 1 

Then  since  x £ B(C, e ),  a compact  set,  there  is  some  subsequence 

- - 

{x  } which  converges  to  x . Let  x solve  (3.7)  for  a,  then 

K . 

since  a > a for  all  i, 

k, 

d(x  ^,C)  > d(x,C)  , £ = 1,2,  ...  , 

and 

d(x',C)  > d(x,C) 

by  the  continuity  of  d(*,C).  But 

liF(x')ll  <5  and  x'  £ B(C,6^) 
so 

d(x' ,C)  < d(x,C) 

and 

d(x',C)  = d(x,C)  . (3.9) 

Since  {r(o^ ’^))  is  a subsequence  of  {r(ct^))  and  >9(q:^^) 

i>y  (3.7),  we  have 
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lim  = lim  cp(a 

k i , 

k 

= lim  d(x  ,C) 
i 

= d(x',C) 

= d(x,C) 

= 9(a)  . 

r is  ri^t  continuous  at  zero,  in  particular,  so  we  can  choose 
> 0 so  that  r(e^)  < e^. 

Now  we  must  show  that  we  really  are  making  progress  along  the 
curve.  Since  C is  compact  we  can  parametrize  it  by  path  length. 

If  C = {x|x  = c?)(t),  t £ [0,T],  ||9(t)||  = 1).  Then  we  say  that  C 
has  length  T. 

We  can  measure  our  movement  along  C during  the  k^^  subproblem 
by  d(h(x^_^^Q,T),  h(x^^Q,T)).  For  simplicity  let  = h(Xj^  q,  t) 

^^^k,o- 

Since  ||u(Xj^)||  = 1,  d(Xj^,Xj^)  = t,  where  \ = \+  '^^^k-l^* 

Since  ^ (x|(u(xj^),  3^-x)  = 0)  = H(x)  and  x^  is  the  solution 

of  ) d(Xj^,x),  we  have  d(xj^,  > t.  Also,by  the 

Ic 

termination  criterion, 


d(xj^,yk)  < e , 


so 


<i(yk’yk+l^  > - d(xj^,yj^) 

> T - e . 
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So  choose  e = min(e^,T/2)  and  > ^/2.  If 

Yq  = X®,  then  for  an  integer  K > T-2/t,  > T,  implying  that 

Yjj  ^ C.  By  definition  y^^  e F"^(0).  In  order  to  conclude  that  there 
is  some  Is.  such  that  x^  £ 0 and  ^ o we  must  guarantee  that 

there  is  some  yj^^^  £ 0. 

Suppose  that  A(x' ) = j,  then  since  0 is  a good  value 

F'(x') 
bj(x') 

has  rank  n+1.  We  are  concerned  with  avoiding  the  following  situation. 


Thus  if  cp(T)  = x*  then  b .(cp(t))  is  a function  from  ]R  ^ into 

J 


TR^  and  ^bj((p(t)) 


>0  by  the  nonsingularity  of 
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at  X'  and  the  fact  that  bj(cp(T'))  < 0.  Thus  there  is  an  interval 
(T,  T + y)  over  vdiich  b.(cp(t))  > 0 and  hence  q)(t)  a for 
t e (T,  T + r). 

To  insure  that  there  is  some  £ {y|y  = cp(t),  t £ (T,  T + y)) 
we  first  need  to  get  an  upper  bound  on  d(y  , y ) . Remember  y 

iC'r  X JL 

= h(x^, t)  and  6 is  the  radius  of  the  ball  around  y^^  which 

+ Tu(Xj^)  must  be  in.  We  have  shown  that  no  matter  how  small 
6 > 0 is,  e and  t can  be  chosen  so  that  x^^  € So  if 

we  choose  6,  €,  and  t so  that  5 + e + t < y then 

d(yk»yk+i)  < ^ ^ ^ < r. 


Then  there  is  some  k such  that  y^^  G a,  yj^+j^  ^ Clearly  we  can  | 

choose  e small  enough  so  that  llxj^+3_  - yjj+]^ll  ^ implies  that  ! 

\ ^ Vi  ^ ■ i 

i 

1 

Next  we  show  that  termination  will  occur  after  a finite  number 
of  repetitions  of  Step  6.  j 

Theorem  3.13.  e,  t,  and  6 > 0 can  be  chosen  so  that  the  path  ] 

1 

following  algorithm  will  compute  an  approximate  solution  to  x*  after  | 

a finite  number  of  steps.  | 

i 
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Proof.  Let  be  the  facet  of  a containing  x*.  Let  y > 0 

be  the  radius  of  the  domain  of  attraction  for  x*  using  Newton' s 
method  on 


Let  Y < Y be  the  radius  of  a ball  around  x'  such  that  C nB(x*, y)  is  connected, 

and  if  x = C D dB(x*,y)  then  d(c\  B(x',y),  B^)  = d(x,B^),  that  is, 

the  minimum  distance  from  the  curve  outside  of  the  y-neighborhood  of 

X*  is  the  facet  B^  is  attained  at  the  intersection  of  the  curve  C 

smd  the  boundary  of  the  ball.  Such  a y exists  by  the  continuity 

compactness  of  C and  B^.  Let  9 = d(x,B^). 

We  want  to  show  that  the  first  point  in  the  sequence  {x  ) 

i+1  £+1 

which  is  not  in  a,  call  it  x , is  such  that  x £ B(x*,y), 
and  hence  Newton's  method  in  Step  6 will  converge  to  x*. 

0 0J.*|  p p 

So  X £ 0,  and  x ^ a.  Now  x £ B(y  ,6)  and 

X £ B(y  ,6)  but  there  are  three  cases  to  consider  depending  on 

il  i+1 

whether  y and  y are  within  or  outside  of  o.  We  will  choose 
€,  T,  6 > 0 so  that  y+e+T+6<y  and  e < y. 

Case  1.  y £ o.  Then  d(y  ,x  ) < c means  that  d(y  ,x*)  < y. 

So,  we  conclude 

d(x  ,x*)  < e + y < y . 


6l 


Case  2.  y ^ a,  y t a. 

By  part  of  the  proof  of  Theorem  5.12  (pg.  12)  for  any  t £ [0,t], 
d(x'^  + tuCx"^),  h(x'^,t))  < 6 . 

Z Jlf  £ ^+1_ 

Now  since  h(x  ,0)  = y £ a and  h(x  = y ^ a,  an  intermediate 
value  argument  along  the  curve  C would  show  that  there  is  some 
s £ [0,t]  such  that  h(x'^,s)  = x*.  (See  Figure  3.l4, ) 


FIGUBE  3.14 

Let  x^  = x"^  + suCx"^).  Then  d(x®,x*)  < & and  d(x^'*'\x®)  < x. 
By  the  triangle  inequality 

d(x  ,x*)  < 6 + T < y . 


Case  3.  y o.  This  implies  that 


so 


X € B(x*,f) 


d(x'^^^,x*)  < f + d(x'^,x'^^^) 


<Y'  + T+  5<y. 


Hence,  in  all  cases  d(x'^^^,x*)  < y and  hence  x'^^^  is  in  the  domain 
of  attraction  of  x*  for  the  iterative  process  in  Step  6.  || 


From  the  large  number  of  times  we  used  worlds  like  "if  € 
is  chosen  small  enough"  one  may  get  the  impression  that  the  tolerances 
must  be  chosen  so  small  that  the  algorithm  would  be  grossly  inefficient. 
In  fact,  from  o\ir  computational  experience,  quite  large  stepsizes  can 
be  chosen  and  the  algorithm  still  converges. 
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CHAPTER  IV 


PATH-FOLLOWING  METHODS 


In  the  first  two  sections  of  this  chapter  we  prove  some  theorems 
concerning  the  existence  of  paths  which  are  closely  related  to  the  paths 
of  Kellogg,  Li,  and  Yorke  [1976]  and  S.  Smale  [19^6].  In  the  remainder 
of  the  chapter,  two  new  path  methods  are  discussed  which  are  closely 
related  to  the  paths  defined  by  certain  fixed  point  algorithms. 


I 


II. 1.  Kellogg,  Li,  and  Yorke' s Continuation  Method. 

Let  a be  a bounded,  convex  cell  in  ]R  and  f • a -»  3R  ^ 

2 

a C function  which  satisfies 

Assumption  1.1.  f(x)  points  into  a for  all  points  x in  the  boundary 
of  a. 

Define  the  solution  set  E = {x  € a|f(x)  = 0).  Let 
h : a \ E ->  do  be  denoted  as 

h(x)  = X - u (x)’f(x)  , 

where 

n(x)  = {^  > 0|x  - nf(x)  e aa)  . (1.1) 

Note  that  |i(x)  is  well  defined  because  f(x)  ^ 0 for  x ^ E, 
a is  convex,  and  the  boundary  condition  (Assumption  1.1)  implies 
that  for  X € ba,  n(x)  = 0.  Hence,  h is  the  identity  on  the  boundeu'y 
of  a.  The  function  h is  often  called  a retraction  of  the  set  a\  E 

6U 


1 


j 

1 

1 


4 


to  the  boundary  of  a.  This  method  is  concerned  with  the  path  h ^(x*^) 
where  is  some  point  on  the  boundary  of  a.  This  path  method  was 


||  motivated  by  Hirsh's  [I965]  proof  that  there  is  no  continuous  retraction 

of  a set  to  its  boundary, 

Ij  Intuitively,  h”^(x*^)  is  the  set  of  points  x for  which  -f(x) 

ji  0 

j points  at  x . Figure  1.2  depicts  this  behavior. 

li 

;l 

I 


FIGURE  1.2  ; 

< 

The  main  result  of  Kellogg,  Li,  and  Yorke  [ 1976]  can  be 
stated  as  i 

Theorem  1.^.  For  almost  every  x^  £ 3a  the  set  h ^(x^)  consists  of 
a number  of  components,  each  one  of  which  is  a diffeomorphic  ima^e  of 
a circle  or  interval.  Furthermore,  for  any  e > 0,  the  connected 
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component  y c h ^(x*^)  leading  from  is  such  that 

r n B(E,e)  ^ 9 . 

In  other  words,  y gets  arbitrarily  close  to  some  zero  of  f. 
The  proof  of  Kellogg,  Li,  and  Yorke  used  only  results  from  advanced 
calculus,  and,  hence,  it  was  quite  long.  We  shall  prove  a very  similar 
result  using  the  theory  of  Chapter  II.  The  new  result  will  be  just  as 
useful  for  computational  purposes. 

Let  I be  the  closed  interval  [ -1, 1]  and  let 
F:M  = axI->IR'^  be  defined 

F(x,0)  = 0f(x)  - (l-e)(x-x°)  (1.2) 

for  some  x®  € and  -r  is  some  facet  of  cr.  We  shall  refer  to  the 
use  of  this  deformation  to  define  paths  as  the  homotopy  retraction 
method. 

If  (x, 0)  e F“^(0)  for  some  0 < 1,  then  x £ h~^(x®) 

because 

^(x)  - (1-0)  (x-x*^)  = 0 

implies 


m(x)  = and  h(x)  = x*^  . 

A nice  feature  of  our  deformation  is  that  F(x, 0)  is  defined  for 
X £ E while  h(x)  is  not.  Clearly  (x, 1)  € F ^(O)  implies  that 
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f(x)  =0  or  X £ E.  Next,  we  have  the  result  which  is  essentially 
equivalent  to  Theorem  1.3. 


Theorem  l.U.  If  Assumption  1.1  holds  and  0 is  a good  value  of 
F:M  then  there  is  a unique  point  (x*^,6^)  £ D F ^(O)  and  = 0. 

The  component  y c F ^(O)  containing  (x^,0)  has  an  opposite 
boundary  point  (x*,l).  Also,  x*-  £ E and  |e|  is  odd. 

2 

Proof.  Since  f is  C , and  0 is  a good  value.  Theorem  1.3.15 
gives  us  that  F ^(O)  is  a 1-manifold  with  boundary,  neat  in  M. 

The  reason  that  I = [-1,1]  was  chosen  for  the  domain  of  d 
rather  than  I'  = [0,1]  is  that  [x^,0]  would  be  in  two  facets  of 
o X I'  which  would  make  it  impossible  for  0 to  be  a good  value. 

Consider  solutions  to 

f(x,  e)  = ef(x)  - (i-e)(x-x*^)  = o 

0 < 1 (1.3) 

(x, 0)  £ 

It  is  clear  that  F ^(O)  n (a  x {0})  = (x^,0).  Parametrize  y 
by  cp  = [0,T]  -» M,  such  that  cp(0)  - (x*^,0),  cp(T)  £ 3m,  and  cp(t)  £ 
t £ (0,T).  If  we  can  write  cp(t)  = (x(t),0(t)),  then  0(O)  > 0 would 
imply  0(t)  > 0,  ^ £ (0,T]  because  if  not,  then  0(0)  = 0,  0(€)  = 0 
implies  that  r is  not  neat  in  M,  a contradiction. 

The  orientation  of  cp(t)  is  determined  by  guarsmteeing  that 
x(0)  point  into  a.  But  (x(O),0(O))  = Q!  ‘(v, 1)  where  a is  chosen 
so  that  ocv  points  into  0,  and 
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I 

I 


f 


F’ (x,e) 


(x°,  e) 


or 


[0-f’(x)  - (l-0)l|f(x)  + (x-x°)] 


(x’^,0) 


(I)  =0 


-Iv  = -f(x°)  , 
V = f(x°)  . 


Since  f(x^)  points  into  a,  a > 0,  and,  hence, 
and  0 > 0 for  (x, 0)  € r. 

Suppose  (x',0')  satisfies  (1.2)  for  (0,1). 

x'  € do,  and  f (x' ) ^ 0,  and 


1 


I 


X* 


0(0)  > 0, 


Then 


(l.M 


f(x' ) 0 implies  x'  ^ x^.  Now  x'  € do,  f(x' ) points  into  o, 

and  x*^  € do  means  that  (l.^)  is  a contradiction  of  the  convexity 
of  o.  Hence,  there  is  a unique  solution  to  (1.5)  when  0 > 0. 

We  have  shown  that  dr  \ {(x^,0)}  c {(x, 0)|(x, 0)  € dM,  0 > 0, 

X ^ do)  which  means  that  dr  \ {(x°,0))  = (x*,l)  and  f(x*)  =0. 

Suppose  there  is  euiother  point  x'  € E,  then  (x’,1)  e F“^(0)  n dM. 
Suppose  that  r*  F ^(O)  is  the  connected  component  containing  (x’,1). 

If  we  can  show  that  for  any  (x, 0)  e rS  0 > 0,  then,  by  the  arguments 
above.  dr*\  {(x',1))  = ((x^,l))  and  f(x^)  = 0.  Suppose  there  is 
some  (x, 0)  £ r*  with  0 < 0,  then,  by  continuity,  there  is  some 


•■j 
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(x,5)  £ y*  with  9 = 0,  which  implies  x = x^.  But  r'  IT  and 
F~^(0)  is  a neat  submanifold,  a contradiction. 

Thus  we  have  shown  that  each  solution  x'  ^ x*  is  connected 
by  some  component  (Fig.  1.4)  of  f”^(0)  to  one  other  solution.  Since 
0 is  a good  value  and  M is  compact,  there  are  a finite  number  of 
elements  in  E and  the  theorem  is  proved.  | 


(x\l) 


(x^l) 


O X (1) 


\ 

/ / 


cr  X {0} 


(x0,0) 


0-  X {-!) 


FIGURE  1.4 


Kellogg,  Li,  and  Yorke  [1976,  p.  478]  explore  some  conditions 
which  imply  that  x^  will  be  guaranteed  to  be  a g^od  value  for  k. 
One  of  these  conditions  waus  the  following 
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Assumption  1.^.  (Eigenvalue  condition)  Suppose  that  the  matrix 
f’ (x)  has  no  eigenvalues  which  lie  on  (O,®)  for  any  x G a\  E. 

The  result  which  follows  immediately  is 


Theorem  1.6.  Let  f ; c -»  I?  and  suppose  assumptions  1.1  and  1.5  are 
true.  Then  each  x^  on  a part  of  do  is  a regular  value  of 

h,  and  the  curve  starting  at  x*^  is  well  defined  and  goes  to  the 


set  E. 


Kellogg,  Li  and  Yorke  make  the  conjecture  that  assumption  1.5 


implies  that  E has  only  one  connected  component  which  can  be  joined 
to  do  by  a path  in  h”^(x*^).  If  we  assume  a little  more,  i.e.,  that 
f (x)  has  no  eigenvalues  which  lie  in  [0,®)  for  any  x in  a,  then 
using  Hopf's  theorem  on  the  index  of  vector  fields  [Milnor,  1965], 
one  can  show  that  f has  a imique  root  in  o.  We  shall  prove  this 
result  and  a result  concerning  the  monotonicity  of  paths  with  this 
stronger  assumption  using  the  orientation  result  of  Section  III. 2. 


n 2 

Theorem  1.7.  Let  f:a  ->]R  be  C , assumption  1.1  hold,  and  for 
every  x G a,  f ' (x)  has  no  eigenvalue  which  lies  on  [0,®).  Then 
f(x)  =0,  X G a has  a unique  solution,  and  for  any  x^  G do 
defining  F,  there  is  one  component  in  F ^(O).  Furthermore,  if 
F ^(0)  = {(x,e)|(x,0)  = (x(t),0(t)),  tG  [0,T]),  vrtiere 
(x(0),e(0))  = (x^,0)  and  (x(T),0(T))  = (x*,l),  then  ^(t)  >0 
for  any  t G [0,T] . 
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1 

Proof.  We  will  prove  the  last  statement  first.  From  the  proof  of 
i Theorem  1.4  we  have  that  0(0)  > 0 and  that  P{t)  > 0 for  any  t > 0. 

Thus,  we  only  need  to  show  that  h{t)  > 0 for  t e [0,T]. 

Recall  that 

F'(x,0)  = [ef’(x)  - (l-0)l|f(x)  + (x-x°)] 

= [E(x) |e(x)]. 

Any  eigenvalue  of  E(x)  has  the  form  &h  - (1-0)  < 0 where  A < 0 

is  an  eigenvalue  of  f'(x).  Hence,  E(x)  is  never  singular  for  any 

X € o.  This  means  that  0(t)  ^ 0 for  any  t € (0,T]  by  Theorem  III. 2. 3. 

Hence,  by  the  continuity  of  0(t),  &(t)  > 0 for  any  t € [0,T]. 

Suppose  there  was  another  component  y c.  F ^(O),  then  by 

studying  the  proof  of  Theorem  1,4,  we  see  that  y would  have  boundary 
12  12 

points  at  (x  ,1),  (x  ,1)  where  x and  x are  in  E.  Setting  up 
a parametrization  of  y,  (x(t),5(t))  = [0,T]  -> M it  is  clear  that  for 
some  t e (0,T),  d0(t)/dt  = 0 which  implies  that  det  E(x(t))  = 0, 
a contradiction.  Hence,  F ^(O)  has  a unique  connected  component. 

Similar  considerations  would  show  that  x*  is  the  only  solution  of 
f(x)  = 0,  X e a.  I 

Thus  we  have  shown  that  Assiamption  1.5  is  so  strong  that  the 
homotopy  parameter  0 is  guaranteed  to  increase  monotonically  from 
zero  to  one.  The  literature  of  continuation  methods  for  solving 
equations  commonly  iMikes  assumptions  necessary  to  insure  that  the 
homotopy  parameter  increases  monotonically  along  the  path  (Avila  [1974])  . 
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However,  it  is  well  known  that  monotonic  behavior  by  any  variable  can 
rarely  be  depended  upon  in  equation  solving  problems  (Eaves  and  Scarf, 
[1976]).  The  path  following  algorithm  of  Section  III. 5 does  not  assume 
monotonicity  in  any  variable.  The  orientation  results  of  Section  III. 2 
are  what  allow  us  to  follow  paths  which  are  not  monotone  in  any  variable. 


IV.2.  The  Global  Newton  Method 

In  this  section,  we  discuss  a path  method  that  satisfies  the 

k 

differential  equation  version  of  the  Newton  recursion  x - x = 
f ’ (x^)  ^ f(x^),  k = 0,1, ...  , except  for  a factor  of  + 1 which  is 
determined  by  the  determinant  of  f ’ (x) . To  be  precise,  if  x(t)  was 
a parametrization  of  the  path,  then  x(t)  is  specified  by  the  differ- 
ential equation 

f"(x)  ^ = -X(x)  f(x)  , X G D (2.0) 


\^ere  X is  an  arbitrary  scalar  function  of  x such  that  sign  X(x)  = 

+ sign  det  Df(x). 

This  method  was  first  described  in  a paper  by  Smale  [1976].  The 
global  Newton  method  is  a differentiable  analogue  of  Scarf's  fixed  point 
algorithm  [1965]  and  Eaves'  "vector-labelling"  algorithm  [I969]. 

Varian  [1977]  described  a method  which  allows  the  path  to  move  in  and 
out  of  D.  In  the  next  section  we  will  present  a generalization  of 
Varian' s method  and  prove  that  the  path  produced  by  this  algorithm  is 
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1 


identical  to  the  path  produced  by  Eaves'  algorithm  in  the  limit  as  the 
mesh  of  the  triangulation  goes  to  zero. 


Once  again,  we  are  to  find  x € D c=  ]R^  such  that  f(x)  = 0, 

o 

where  D is  a boxmded  cell  with  one  facet  3d.  Suppose  that  f is  C 
and  the  following  boundary  condition  holds: 

Assumption  2.1.  For  x € dD,  det  f ' (x)  ^ 0 and  there  is  a choice, 

(a)  sign  ^(x)  = sign  det  f'(x),  all  x € Bd,  or  (b)  sign  >y,(x)  = 

- sign  det  f'(x),  all  x € dD,  which  makes  -A(x)  f'(x)”^  f(x)  point 
into  D at  each  x € dD. 

Again,  let  E = {x  € D|f(x)  = 0).  Then  we  can  define  the  C‘^ 
map  g = D \ E -4  as 

g(x)  = f(x)/||f(x)||  . (2.1) 

So  g maps  every  point  in  D which  is  not  a zero  of  f to  a point  on 
the  unit  sphere  in  Smale's  main  result  was 

n 2 

Theorem  2.2.  Let  f:D  -»  E be  C and  satisfy  Assumption  2.1.  Then 
for  almost  every  x*^  € ^D,  there  exists  a C curve  cp  = -»  D 

with  ||9(t)||  = 1,  and  t^  maximal,  t^  < » . Also  if  d = g(x*^), 
g”^(d)  = (x|x  = cp(t),  t € [tQ,t^)}  and  cp  converges  to  E as  t 
He  also  proves  a compsuiion  theorem  with  the  added  assumption 
that  0 is  a regular  value  of  f with  the  stronger  result  that  cp 
converges  to  x*  as  t -♦  t^^  and  x*  G E.  We  shall  reformulate  the 
problem  and  prove  a result  similar  to  the  companion  theorem. 
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g ^(d)  is  the  set  of  points  x for  which  f(x)  points  in  the  direction  d. 
Note  that  g"^(d)  does  not  include  its  "boundary  points"  which  are 
elements  of  E. 

Let  F:Dx3R^— »]r'^  be  defined 

F(x,0)  = f(x)  - 0d  , (2.2) 

where  d = f (x°) /||f  (x°)  1|  for  some  point  x*^  € dD.  Then  (x,  0)  € F"^(0) 
for  0 > 0 implies  f(x)/0  = d euid  since  d has  norm  1,  0 = ||f(x)||. 

Thus  xGg“^(d).  Clearly  x € g"^(d)  implies  (x, 0)eF"^(o)  for 
6 = ||f(x)||.  In  this  case,  we  have  solved  the  problem  when  we  find 
(x,0)  e F'^(O). 

Since  D is  con^jact,  we  can  define  K = max  l!f(x)||,  and  K<oo. 

xGD 


7^ 


Pick  an  L € (K,<»).  Then  M = D x [0,L]  is  the  domain  of  interest 
for  F. 


Theorem  2.3.  If  0 £ is  a good  value  for  F and  Assumption  2.1 

holds,  then  the  path  y <z  F”^(e)  beginning  at  (x*^,0^)  has  another 
endpoint  (x*,0)  such  that  f(x*)  = 0. 


Clearly  (x^,  0^)  solve  (2.3)  because  x*^  £ dM  and  Assumption  2.1 
implies  that  f(x^)  ^ 0 and  so  6^  = ||f(x^)||  > 0.  No  point  (x,  0)  £ F”^(0) 
can  have  0 = L because  otherwise  |lf(x)||  = L > K contradicting  the 
definition  of  K. 

Suppose  there  is  an  (x,0)  £ y such  that  x £ Sd,  0 £ (0,L). 

Let  (x(t),0(t))  be  a parametrization  of  y by  arclength  with 
(x(O),0(O))  = Then  (x(t),0(t))  is  the  tangent  of  y at 

(x(t),  0(t))  in  the  direction  away  from  (x^,0^).  To  compute 
(x(O),0(O))  we  choose  v £ such  that 

(a)  V points  into  D 

(b)  f (x)v  = A(x(0) )d 

(c)  A(x)  = sgn  det  Df(x^),  WLOG 
Then 
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1 


(i(o),a(o))  = 


v,a(x°); 


(WLOG  means  that  if  (v, A(x(0)))  does  not  point  into  M for  this 
definition,  we  can  define  A(x)  = -sgn  det  f'(x).)  Assumption  2.1 
implies  that  the  solution  v of  (b)  is  unique.  As  we  move  along 
(x(t),0(t))  let  t such  that  (x(t),0(t))  = (x, 0)  be  the  first  t > 0 
such  that  (x(t),0(t))  £ dD  x (0,L).  Then  x(t)  does  not  point  into  D 


n 


and  there  is  a v £ 3R  such  that 


= (x(t),0(t)) 


(v,A(x))|| 


and 


f’(x)v  = A(x)d  = A(x) 

l|f(x)| 


because  x £ g ^(d).  But 


V = f f(x) 

Ilf  (x)  II 


does  not  point  into  D and 


V = f(x°) 


does  point  into  D,  a contradiction  of  Assumption  2.1.  Thus  there  is 
no  point  in  y other  than  {yP,6^)  which  satisfies  (2.3)  so  by 
Corollary  II. 3. 19  the  other  boundary  point  (x*,  0^)  of  r must 
0*-  = 0,  and,  therefore,  f(x*)  = 0,  x £ D.  | 
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1 


The  proof  indicates  the  connection  between  the  curve  g ^(d) 
and  the  differential  equation  (2.0).  Clearly  the  path  following  algorithm 
could  be  used  if  D was  defined  as  {x|b^(x)  <0,  i (£  m)  for  some 
smooth  functions  b^,  i £ m and  F was  defined  as  in  (2.2). 


IV.3.  The  Strong  Path  Method 

i 

In  this  section  we  define  a deformation  F for  which  one  can  ! 

■^ollow  F ^(O)  to  find  a solution  of  f(x)  =0,  x G D,  with  quite 
weak  boundary  conditions  on  D.  We  shall  then  show  that  Varian' s j 

method  [1977]  is  in  some  sense  a special  case  of  our  method,  and  we  , ; 

j ■ 

shall  use  a result  of  Friedenfelds  [1976]  to  show  that  the  path  we  ! 

define  is  the  same  as  the  limiting  path  of  Eaves'  [19('l]  vector  j 

labelling  algorithm  as  the  mesh  size  goes  to  zero.  i 

■ ’ 

Let  D c ]R  ” be  a cell  defined  as  in  Chapter  I,  | 

i 

D = [x|b.(x)  <0,  i € m] . Clearly  D has  a nonempty  interior  because 

^ I i 

of  the  constraint  qualification  on  the  b.'s.  In  this  section  we  make  ' j 

^ ! I 

the  following  ! j 

Assumption  3.1-  There  is  some  interior  point  c £ D such  that  for  any 
X £ SD,  there  is  no  a > 0 such  that  Qf(x)  = x-c. 

If  0 = 8"^  and  c = 0,  then  the  assumption  states  that  there  ’ 


is  no  X £ ^ such  that  f(x)  points  radially  outwards.  If  the 

condition  fails  for  f we  could  test  it  for  -f  and  solve  for 
-f(x)  = 0 so  that  a weaker  form  of  3.1  could  be  given. 


r 


h 


Assumption  3.2-  There  is  some  c £ D such  that  there  are  no 

X,  y G Sd  for  which  there  are  scalars  3 < 0 < a such  that 
Pf(x)  = x-c,  af(y)  = y-c. 


Next  we  define  the  deformation  FiM  where  (M,  is  a 

subdivided  (n+l)-con5)lex. 


F(x,  0,o)  0f(x)  + o(c-x)  - (l-0)d  , 


(3.1) 


where  xGD,  O<0<1,  o>0.  For  the  moment  we  define 


m 


in  which 


and 


CTq  = { (x, 0,o) |x  e D,  0 € I,  p = 0}  , 

= { (x,  0,o)  |x  G 0 G I,  p > 0)  , 

Ti  = {x|bi(x)  = 0,  b^  < 0,  J ^ i), 

I = [0,1]. 


i G m, 
i G m. 


When  X is  in  D , then  (x, 0,  p)  G M implies  that  p = 0 


and  F is  essentially  the  deformation  defined  for  the  Global  Newton 
1-1 


method  (2.2)',  F 


(O)  ^ (x,0,O)  for  0 < 1 implies  that 


f(x)  = 0/(l-O)d  or  f(x)  points  in  the  same  direction  as  d. 

When  X G 9d,  then  (x, 0,p)  G M is  still  in  an  n+1  cell  because 
0 is  allowed  to  increase  from  zero.  Note  that  any  facet  of  both  cr^ 
and  Pq  for  some  i £ m is  defined  by  the  fact  that  p is  identically 
zero  on  that  facet. 
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Assumption  3.3.  c and  d £ E are  chosen  so  that  the  ray 


r = {y|y  = c - od,  a > 0}  intersects  dD  in  only  one  point  {x°}, 
and  is  in  only  one  facet  of  D. 

Example  3.4.  a)  fails  to  satisfy  Assumption  3.3. 

d 


h)  satisfies  Assumptions  3.3  and  3.1. 


Define  for  some  set  S c M to  be  {x|(x,  6,0)  € S) . 

Points  in  P^(F  ^(O))  n dD  have  the  property  that  d is  in  the  cone 
spanned  by  f(x)  and  (c-x),  i.e.,  if  0 < 1, 

Tie  = d 

>0,  p > 0 . 

In  order  to  apply  the  theory  of  Chapter  II  we  must  put  an  upper 
bound  on  p to  make  each  of  the  cells  and,  hence,  the  subdivided 
complex  M compact  sets. 

Lemma  3.6.  There  is  a Q such  that  (x, 0,o)  € F"^(0)  implies  that 


Proof.  Since  o s 0 for  points  (x, 0,o)  € F ^(O)  vrtiere  x £ D^. 

Ic  k "1  i 

Assiame  there  is  a sequence  {x  , 0^,  p ] c F (O)  such  that  x € 

for  all  k and  for  any  Q > 0,  there  is  a k such  that  p“  > Q. 


’(x^)  + (c-x^)  - d = 0 


Since  (x^, 0^)  £ SD  x I,  a compact  set,  and  f is  continuous. 


0^f(x^)  (1-0^) 

lim  •— z=  0 = lim  V ■ ^ / . 

k-»a>  p k-»<»p 


This  implies  that  lim  c - x^  = 0,  -vdiich  is  impossible  because  c is 
k->  00 

an  interior  point  for  D.  | 


Thus  we  can  let  J s [o,Q]  and  redefine  cr^  = x I x J, 
i £ m,  and  let  (M,^)  be  redefined  accordingly. 

Theorem  3.7.  Given  Assumptions  3-1  and  3.3  and  that  0 is  a good 
value  for  F with  respect  to  (M,^),  the  path  y c F"^(0)  with 
(x°,0,o*^)  (x  £ 3d,  p°  = ||d||/||c-x*^||)  as  one  boundary  point  has  an 
opposite  boundary  point  (x*, 1,0)  and  x*  £ E = (x  £ D|f(x)  = 0). 
Furthermore  |e|  is  odd. 


Proof.  Clearly,  (yp.Q,(P)  c,  3M  fl  F ^(O)  by  the  definition  of  jp 
in  Assumption  3-3.  (P  is  positive,  so  {P,0,p)  is  in  only  one  facet 
of  the  cell  (remember,  x^  £ Xj^).  Since  0 is  a good  value,  there 
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-Jll. 


By  studying  the  defintion  of  one  notes  that 

(x,0,p)  e dM  means  either  G = 0 or  1,  or  o = Q-  By  the  definition 
of  Q,  the  latter  type  of  boundary  point  is  impossible. 


If  e = 0,  then  F(x, G,o)  = 0 inplies 

o(c-x)  =_d=4>  d>0=^x£^D,  ' { 


0 


By  Assumption  3.3,  x = x*^,  and  the  solution  to  (3.2)  is  unique. 

Thus,  the  other  endpoint  of  Y",  (x*,  ^,d*)  , must  have  G*-  = 1. 
Suppose  D*  > 0,  then  x*  £ dD  and 

f(x*)  = p*(x*-c)  , 

a contradiction  of  Assumption  3.1.  Hence,  p*  = 0 and  f(x*)  = 0. 

The  fact  that  |e|  is  odd  is  a simple  consequence  of  the 
following  facts:  |F  ^(O)  D dM|  is  even  (Prop.  1.3.18),  (3.2)  has 
a unique  solution,  • and  (x, l,o)  £ F“^(0)  implies  p = 0 and  x £ E.  | 
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The  assumption  that  0 is  a good  value  for  F with  respect  to 
removes  the  possibility  of  boiuidary  solutions,  i,e,,  x € SD 
such  that  f(x)  = 0.  In  this  case  (x,  1,0)  £ F~^(0)  n SM,  (x,  1,0)  £ cTq 
ajid  (x,  1,0)  £ for  some  i £ a contradiction  that  F ^(O)  meets 
only  the  interior  of  facets  of  cells  in  One  could  easily  relax 

the  regularity  conditions  on  Theorem  5.7  so  that  x*  could  be  a 
boundary  point  of  D,  but  the  oddness  of  |e|  could  not  be  guaranteed. 

IV. 4.  Varian' s Modification  of  the  Global  Newton  Method. 

Varian  [1977]  proposed  a modification  of  Smale's  method  >diich 
allowed  the  boundary  conditions  to  be  weakened  considerably.  We  shall 
show  how  Varisin' s modification  is  related  to  the  strong  path  method. 

Suppose  ftD*^  is  a function,  where  = B(0, 1) 

the  closed  unit  disc  in  ]r'^.  The  problem  is,  as  usual  to  find  x 
which  solves  f(x)  =0,  x £ D^.  Suppose  f satisfies  the  following 


i 

i 

I, 


Assumption  4.1.  At  all  x £ there  is  no  a > 0 such  that 

f(x)  = QX. 

It  is  clear  that  this  boundary  condition  implies  Assumption  3.1 
when  D = d"  and  c = 0 £ E 

Let  Dg  be  a disc  in  of  radius  2.  Let  s(x)  = i(x||-l 

and  define  the  following  function  on  D^t 

h(x)  = -s(x)'i|^  + (l-s(x))  1 < llxll  < 2 

= f(x)  , 0 < 11x11  < 1. 
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This  function  coincides  with  f on  and  is  a continuous 
extension  on  In  fact  h is  on  02X0"^  and  on  the 

interior  of  but  not  on 

We  must  define  a subdivided  con^Jlex  (M,  , where 

9^  = {D*^  X ]R  \d2\  X IR  , so  that  h is  continuous  on  M sind  on 

pieces  of  ^)fj  . The  deformation  is  identical  to  that  of  Smale  [ 1976] . 

F;M  is  denoted  by  F(x,0)  = h(x)  - Qd,  and  d = h(x^) /||h(x^)  || 

for  some  x^  £ 

For  X 6 h(x)  = -x/2  and  h' (x)  = -l/2;  hence  it  is  easy 

to  see  that  h satisfies  Assumption  2.1.  Since  is  compact,  we 

can  define  K = max  ||h(x)||,  and  pick  L > K.  Then 
xeD“ 

'(y  , (d"  X [o.t],  dJ  X [0,L]}  is  a compact  subdivided  complex, 
and  we  can  prove,  almost  immediately,  the 

Proposition  4.2.  If  0 is  a good  value  for  F with  respect  to  (M,  *^) 
and  Assumption  4.1  holds,  then  y c:  F ^(O)  beginning  at  (x*^,  9^)  € x (0,L) 
has  another  boundary  point  (x*,0)  such  that  f(x*)  = 0. 


Proof.  Since  Assumption  2.1  is  satisfied  for  h,  it  is  immediate  that 
(x*,6H<')  has  0*  = 0 by  the  proof  of  Theorem  2.5*  All  we  need  to  show 
is  that  X*  € because  then  F(x*,0)  = h(x*)  = f(x*)  = 0. 

Suppose  X*  is  in  DgX  If  llx*||  < 2,  then  1 - s(x*)  > 0, 

and 

h(x*)  = - s(x*)  + (l-s(x*))  f(u^)  = 0 


or 
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X*  \ _ s(x*)  X* 

llx*||  l-s(x*)  l|x*|| 


but  this  contradicts  Assumption  k. 1. 

If  ||x*||  = 2,  then  h(x*)  = -x*/2  ^ 0,  a contradiction.  Hence 
X*  € D*'.  I 


The  formulation  above  is  essentially  the  same  as  Varian' s, 
with  the  following  differences.  Varian  considers  gtD^lE  -♦ 
denoted  by  g(x)  = h(x) /||h(x)  ||  and  the  path  defined  by  g~^(d), 
vdiere  d is  defined  as  above.  The  regularity  assijmption  Varian  makes  . 
is  that  d is  a regular  value  for  g restricted  to  0°  \ g re- 
stricted to  and  g restricted  to  the  boundary  of  This  assump- 

tion implies  that  d is  a good  value  if  one  assumes  there  are  no 
zeros  of  f on  It  is  easy  to  see  that  {x|(x, 0)  € F (O) 

for  some  0 > 0)  \ E = g”^(d).  That  is,  the  two  formulations  provide 
essentially  identical  paths. 

Figure  4.3  provides  some  geometrical  insight  into  the  method. 


f(x*)  = 0 


FIGURE  4.3.  The  vectors  along  the  path  g”'^(d)  are  all  paurallel  to  each 
other. 
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The  relationship  of  Varian’ s method  to  the  strong  path  method 
is  that  if  the  portion  of  the  path  g”^(d)  contained  in  \ 
were  projected  onto  the  boundary  of  then  the  resulting  path  would 

he  identical  to  the  path  followed  by  the  strong  path  method  for 
in  the  x-variables.  The  following  proposition  makes  this  statement 
more  precise. 

Remember  the  deformation  for  the  path  method  (5.1) 

F(x, 0,d)  = 0f(x)  + p(c-x)  - (l-0)d  , 


defined  on  the  subdivided  (n+1) -complex  (M,  where 

{D^  X [0,1]  X 0,  SD°  X [0,1]  X [0,Q]},  where  Q > 0 is  chosen 
so  that  o < Q for  any  (x, 0,o)  £ F ^(O).  The  interior  point  c is 
chosen  as  the  origin  and  d = -x^/2  for  some  initial  point 


Proposition  U. 4.  The  following  sets  are  identical; 

= (g”^(d)  n d“)  U {x|x  = y/||y||,  y € (g"^(d)  n d”  \ D*^)} 

ajnd 

Proof.  (S^  <=  S^) . 

If  X £ (g  ^(d)  n D*^),  then  f(x)  7^  0 so  we  can  rule  out  0 = 1 
in  S^.  So  let  0/(l-0)  = l/|lf(x)||,  then  0f(x)  = (l-0)d  or 
(x, 0,0)  £ F"^(0)  implying  x £ S^. 
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The  path  prescribed  by  the  strong  path  method,  is  very  nearly. 


the  path  which  would  be  followed  by  some  fixed  point  algorithms  for 
solving  the  same  problem.  This  is  not  surprising  because  the  strong 
path  method  is  related  to  Smale's  [1976]  "Global  Newton  Method"  which 
was  a differential  version  of  Scarf’s  [1973]  fixed  point  algorithm 
stated  in  terms  of  solving  systans  of  equations. 

The  algorithm  we  shall  discuss  is  a vector  labelling  algorithm 
for  solving  Kakutani  fixed  points  developed  by  Eaves  [1971]  > and  dis- 
cussed by  Friedenfelds  [1976], 

For  a complete  description  of  this  algorithm  one  should  see  one 
of  the  above. 

Suppose  f is  a map  from  C to  C*,  the  set  of  all  convex 
subsets  of  C,  which  has  a closed  graph.  Then  Kakutani' s theorem  [19^1] 
states  that  there  is  an  x £ C such  that  x £ f (x) . Such  sui  x is 

called  a fixed  point  of  f.  Assume  that  C is  contained  in  the  hyper- 

n n 0 

plane  H = {x  G ]R^|  Z x.  = 1],  and  that  a point  c £ is  available. 

i=l  ^ 

Let  N be  an  arbitrary  positive  integer,  and  let  Tl  be  the  set  of 
points  in  H defined  as 

ITS  {x£  ]r”|x  = |jy>  vdiere  Z y.  = N,  and  y.  is  an  integer,  i£n] 

i=l 


Assume  that  H is  triangulated  by  Kuhn' s [ 1968]  method  with  IT 


We  will  define  a piecewise  linear  approximation  to  f below. 


First,  extend  f to  H =)  C by  defining  for  each  x £ H, 


f(x)  if  X £ int  C 

f(x)  = <(  conv[c,  f(x)]  if  X £ dC 
c if  X ^ C. 


The  extended  f is  a closed  map  taking  points  of  H into  convex  subsets 

of  C.  Furthermore,  the  fixed  points  of  the  extended  f are  precisely 

those  of  the  original  f on  the  domain  of  C (Friedenfelds  [1976],  p.  i6) . 

For  each  point  x £ IT,  define  the  function  g(x)  = y-x,  where 

y £ f'(x);  and  let  d ne  an  arbitrary  vector  in  such  that 

n _ n 

E d.  = 0.  Let  g;H  -»  {x|x  £ ]R”,  Z x.  = 0)  = be  the  continuous 
i=l  ^ i=l  ^ 

function  which  is  linear  on  each  ct  £ and  agrees  with  g at  the 
vertices  of  cr. 

The  fixed  point  algorithm  consists  of  following  the  path  g'^CO) 
where  G:H  -♦  is  defined  as 

Gjj(x,  e)  = g(x)  + ed  , and 

(M,  is  defined  by  = {a  X 3R^|ct  £ . Define, the  direction 

11*^1  1 

ray,  r,  as  r = {(x,r)  £ ]R  [x  = c + rd,  r € Assxime  the  points 

of  r are  such  that  no  point  of  r is  a convex  combination  of  less  than 
(n-1)  points  of  IT.  For  any  a £ we  say  that  a touches  r if 

(x,  d)  £ o'  cz  H \ C and 

(x, 0)  £ G”^(0)  then  cr  touches  r.  Friedenfelds  ([1976],  p.  1?) 
shows  that  if  one  begins  in  a cell  CTq  that  touches  r \ C,  and  moves 
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along  G ^(O)  towards  C,  after  passing  through  a finite  number  of 
cells,  one  arrives  at  a point  (x,0)  £ G ^(O)  imply  that  g(x)  = 0. 

X is  called  an  approximate  fixed  point  of  f.  Eaves  [1971]  shows  that, 

-N 

if  X is  the  approximate  fixed  point  for  the  algorithm  when  N is 

the  denominator  in  the  Kuhn  triangulation  of  H,  then  each  cluster 

-N 

point  of  {x  } is  a fixed  point  of  f. 

Define  the  collection  of  cells 


= {a  £ (M,^)jj|cr  n gJJ^(O)  / (p) 


The  set  of  almost-complete  points  is  defined  as 


A^  H {x  £ HjO  £ conv[?(x)-x,  d]) 
= {x  £ h|x  £ conv[f(x),  x+d]} 


In  terms  of  the  original  function,  f,  the  set,  A , within  C is  given  by 


A^nc=<x£C|x£  I 


conv[ f (x) ,x+d]  if  x £ int  C 


conv[ f (x),x+d, c]  if  x £ SC 


It  is  easy  to  check  that  everj'  sequence  of  cells  {ct  ] with 

N N 

a c C converges  to  A^  as  N -» ».  It  is  easy  to  check  that  the 
points  of  A^  outside  of  C are  precisely  those  on  the  direction  ray  r. 
For  any  N = L the  fixed  point  algorithm  can  be  started  at  some 
CTq  = conv[p^, p^]  c H N C.  Let  ' r^  be  an  arbitrary  point  in 
(Jq  D r.  Then  for  each  N > L,  the  algorithm  can  be  started  so  that 
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Pq  is  in  the  convex  hull  of  the  original  set.  We  call  the  initial 

almost-complete  point.  Application  of  the  fixed  point  algorithm  for  any 

N > L then  gives  us  a succession  of  points  of  H,  p, ,...,p  , p ,p  . 

— ^1'  '^n’  ^n+1’ 

where  r^  G conv[pj^, . . . ,p^];  conv[ p^^^, . . . , for  i = 0, 

1, ...,£-n;  and  conv[p^_^^^, . . . ,p^]  cor  'ins  an  approximate  fixed  point. 

Let  Pjj(^q)  - and  let  Pj^C^q)  be  the  continuous  path 

formed  by  joining  p^  to  i = 1,...,  ^-1.  We  call  P^Cr^)  the 

approximate  almost-complete  path  corresponding  to  N.  One  of  Friedenfelds 
[1976]  main  results  concerned  the  limiting  properties  of  Pjj(rQ). 

Let  = [A^  n conv[c,rQ])  U (A^  D C)— i.e.,  A^  is  A^  with  the  points  of 
ray  r further  from  C than  r^  excluded.  Let  PqC^q)  denote  the 
connected  component  (i.e.,  the  largest  connected  subset)  of  A^  which 
contains  r^. 


Theorem  ^.1.  (Friedenfelds  [ 1976] ) 

(i)  Po(ro)  is  a closed  set, 

(ii)  Pjj(rQ)  -^PQ(rQ)  as  N -> « 

(i.e.,  given  e > 0,  Pjj(rQ)  n B(PQ(rQ),€)  for  all  N 
s\JLfficiently  large). 


The  conclusion  that  ^0(^0)  is  a path  cannot  be  drawn  with 

the  assumption  that  f is  a closed  point  to  set  map.  However,  if 
2 

f is  C then  we  can  apply  the  results  of  Section  II. 5 to  show  that 
is  a path.'  In  fact  if  Pq(Xq)  = fl  C where  x^  = r D bC, 

then  P0(xq)  is  the  same  path  as  that  produced  by  the  strong  path 
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method,  projected  onto  H.  To  make  this  statement  precise  requires 
some  preliminaries. 


First,  since  the  strong  path  method  is  for  solving  systems  of  | 

5 

equations,  rather  than  finding  fixed  points,  we  will  be  solving  for  a 
zero  of 

g(x)  = f(x)  - X , X e C . 

Second,  for  simplicity,  we  will  define  the  subdivided  complex  on  which 
F(x,e,p)  = 0g(x)  + d(c-x)  - (1-0) (-d)  (5.2) 

i 

is  defined  as  (M,  ^)  where  i 

'yn  = {aj^^cr^}  = {c  X [0,1]  X 0,  3C  X [0,1]  x [0,Q])  i 

i 

! 

M = CT  U o-p  , I 

I 

i 

I 

where  Q > 0 is  chosen  as  in  Lemma  5.6  so  that  o < Q for  any  ' 

(x,0,p)  € F (0).  This  subdivision  ignores  the  fact  that  C must  be  1 

defined  by  a set  of  inequalities  for  any  practical  applications.  We 
use  -d  in  the  definition  of  F (5.2)  because  the  almost  complete 

path  is  the  set  of  points  for  which  g(x)  points  in  the  direction  j 

opposite  to  d (i.e.,  0 £ conv[g(x) , d] ) for  points  in  the  interior  1 

of  C.  I 

From  Theorem  3.8  we  have  that  if  0 is  a good  value  for  F, 
and  r"  G F ^(O)  is  the  path  containing  (x^,0,o^),  then  the  other 

boundary  point  of  x,  (x*,l,0),  is  such  that  g(x*)  = 0.  : 
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Proposition  ^.2.  Given  Assumptions  3.1  and  3.2,  we  have  0 C = P^(f”^(0)). 


Proof.  Suppose  that  x € P^(F“^(0)),  then  if  x e dC,  there  is  some 
pair  6,  p > 0 such  that 

fgCx)  + p(c-x)  = -(l-e)d  , 

d = 1 implies  that  p = 0 and  g(x)  = 0,  otherwise  x violates 
Assumption  3.1.  But  g(x)  = 0 x € A^.  If  0 < 1,  then  let 
a = P/(l-e)  > 0,  P = o/(l-0)  > 0 and  we  have 

a(f(x)  - x)  + P(c-x)  = - d, 

Qf(x)  + Pc  = (-d-x)  + (a  + p + l)x  . 

If  4 = a + P + 1,  then 

— f(x)  + ^ c + — (d+x)  = X , 
n UP 

which  means  that  x € conv[f(x),x+d,c].  Thus  x £ A^  0 dC  by  the 
definition  (5.1).  Clearly,  the  argument  can  be  reversed  to  show  that 
X e A^  n implies  that  x € P^(f”^(0))  because  any  choice  of 
Aj  such  that  E A^^  = 1,  A^^  > 0,  i £ 5 can  be  written, 
cc/w,  P/u,  l/u. 

If  X € P^(F’'^(0))  n int  C,  then  p must  be  zero,  and  there  is 
some  6 6 [0,1]  such  that 

0g(x)  + (l-e)d  = 0 
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If  0=1,  g(x)  = 0 and  x € D C.  If  0 < 1,  then  let  a = o/(l~e) 
and  we  have 


a(f(x)-x)  + d + X = X 

<==>  af(x)  + d + X = (l  + a)x 

<==>  there  exist  \ ^ ^ \f(x)  +7^(d+x)=x 

<==>  X e conv[f(x),d+x]. 

Thus,  by  (5.1),  x € fl  C.  This  proves  the  proposition.  | 

The  following  relates  the  theorem  to  the  paths  which  the 
algorithms  follow. 

Corollary  5-3.  ^x^^^  • 

We  have  shown  that  path  methods  essentially  follow  the  same  course 
that  a fixed  point  algorithm  follows.  One  might  ask  vdiat  the  point  is 
in  considering  path  methods  \dien  the  fixed  point  methods  are  more  robust 
and  do  not  require  differentiability  or  continuity  assun^tions.  It  is 
the  author's  contention  that  many  equation  solving  problems  deal  with 
differentiable  functions  which  are  sufficiently  well  behaved  so  that 
path  methods  can  process  them  faster  than  fixed  point  methods.  Our 
computational  results  in  Chapter  V,  Part  2 help  to  support  this  contention. 
Of  course,  fixed  point  methods  are  necessary  when  the  functions  are 
non-differentiable  or  map  points  into  sets. 
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IV. 6.  A Class  of  Path  Methods 

A number  of  fixed  point  algorithms  can  be  written  in  a rather 
general  form.  One  need  merely  specify  a triangulation  of  X (0,D], 

an  artificial  map  r:®^  -♦  for  labeling  K*'  x {D},  and  a method 

for  labeling  the  vertices  in  x (0,D)  using  r and  i(x)  = f(x)-x. 

In  particular,  the  algorithm  of  Eaves  and  Saigal  [1972]  chooses  r as 
a one-to-one  linear  map  with  a unique  point  Xq  such  that  rCx^)  = 0, 
and  a decreasing  map  a = (0,1]  -»  (0,<»)  such  that  a(l)  = 0,  a(t)  -»  + » 
as  t -»0.  Then  the  labeling  L of  a vertex  (x,t)  € x (0,D]  is 
defined  by 

n 

L(x,t)  = f(x)  - X if  a(t)  > E |x J 

i=l  ^ 

= r(x)  if  not  . 

' I 

t 

I j 

A geometric  interpretation  of  the  algorithm  is  to  let  F(x,t) 
be  the  continuous  extension  of  L(x,t)  which  is  linear  on  each  piece 
of  the  subdivision  of  x (0,D].  Then,  beginning  at  (xq,D), 

follow  F ^(O)  until  one  reaches  a point  (x*,t*)  where  t*  is  close 

t to  zero  or  until  ||x*||  > K for  some  large  constant  K.  The  former  ’ 

[ 

termination  means  that  x^*-  is  an  approximate  fixed  point,  the  latter 
means  that  the  algorithm  appears  to  have  failed. 

One  condition  for  success  is  contained  in 

I 

Theorem  6.1.  Let  C be  an  open  bounded  set  containing  x^  such  that 
i (x)  + pr(x)  ^ 0 for  all  o > 0 and  x € ^C.  Then  there  is  a D > 0 
such  that  the  algorithm  will  compute  a fixed  point  of  f. 
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Proof.  See  Saigal  [1976a],  for  example. 

Now  it  is  clear  that  an  analogous  class  of  path  methods  can 
be  defined  with  only  minor  changes.  Let 

r(x)  = Ax  - a 

for  some  nonsingular  matrix  A,  and  ftlR*'  is  the  f\inction  we 

are  trying  to  find  a zero  of.  Define  the  deformation 

F(x,0)  = 0r(x)  + (l-0)f(x)  , X e a,  0 e [0,1]  , (6.1) 

where  o is  a bounded  n-cell  in  Let  (a  X [0,1]}.  Then  if 

0 is  a good  value  for  F one  could  follow  the  path  y <=  F”^(0) 
containing  (Xq,1),  vrtiere  x^  = A ^a  £ cr^.  The  algorithm  would  terminate 
at  (x*,  0*’)  if  either  0 = 0 or  x*  £ Scr.  Since  r(x)  has  a unique 
zero,  it  is  Impossible  for  y to  hit  the  facet  aX  [1]. 

It  is  trivial  to  prove  the  following 

Theorem  6.2.  Suppose  that  there  is  no  x £ ba  such  that  f (x)  + Dr(x)  = 0 
for  some  o > 0.  Then  if  0 £ K*'  is  a good  value  for  F w. r.t. 

0 X [0,1],  the  path  y c F ^(O)  containing  (xq,1)  an  opposite 

boundary  point  (x^,0). 

Proof.  By  the  discussion  preceding  the  theorem,  we  need  only  show  that 
there  is  no  (x,  0)  £ y such  that  x £ Str.  If  there  was,  then 

(1-0)  f(x)  + 0r(x)  = 0 
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t 

for  some  0 < 1.  Hence,  if  o = 0/(l-0)  > 0,  then 
f(x)  + pr(x)  = 0 , X € do  , 

a contradiction.  |[ 

As  a special  case  of  Theorem  6.2  we  can  achieve  a path  method 
which  is  convergent  with  assumptions  just  as  weak  as  those  necessary 
for  the  strong  path  method  of  Section  3. 

Corollary  6.3.  Let  r(x)  = -x  + c for  some  c e o^.  Suppose  that 
Assumption  3.1  holds  and  that  0 is  a good  value  for  F as  defined 
in  (6.1),  then  y cz  T ^(O)  which  contains  (0,1)  leads  to  a zero  of  f. 

The  advantage  of  this  class  of  path  methods  is  that  the  starting  5 

I 

point  x^  is  not  a boundary  point  of  a.  So  if  a good  guess  to  a zero  j 

of  f is  known,  that  guess  can  be  the  starting  point. 

A useful  area  of  research  would  be  to  apply  the  results  of 
Saigal  [ 1976b]  to  the  path  methods  presented  in  this  section.  The 
results  contained  therein  suggest  that  if  tP  is  close  to  x*,  a 
root  of  f,  then  the  path  r c F"^(0)  will  be  shortest  if  A = f'(x*). 

Other  results  suggest  that  if  A”^f ' (x)  has  positive  real  eigenvalues 
for  points  x near  x*,  then  the  path  will  be  well  defined. 
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